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(57) Abstract: The invention provides a method, for evaluating a biological condition of a subject, comprising: a. obtaining from the 
subject a sample having at least one of RNAs and proteins; b. deriving from the sample a first profile data set, the first profile data set 
including a plurality of members, each member being a quantitative measure of the amount of a distinct RNA or protein constituent 
in a panel of constituents selected so that measurement of the constituents enables measurement of the biologkal condition; and 
c. producing a calibrated profile data set for the panel, wherdn each member of the calibrated profile data set is a function of 
a conesponding member of the first profile data set and a corresponding member of a baseline profile data set for the panel, the 
calibrated profile data set providing a measure of the biological condition of the subject 
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SYSTEMS AND METHODS FOR CHARACTERIZING A BIOLOGICAL 
CONDITION OR AGENT USING CALIBRATED 
GENE EXPRESSION PROETLES 

Technical Field 

A method is provided for identifying reprodudbje patterns of variation 
of gene expression tiiat are informative by virtue of tiie degree of variation 
observed in a calibrated data set. The variations may be correlated witii other 
noh- genetic indications such as clinical indicators (for huirtans) of a traditional 
nature but are not required per se to be causative. 

Backgroimd Art 

There has been substantial discussion including congressional hearings 
concerning medical errors. One source of medical errors include errors with 
medications. Upwards of 98,000 hospijtalized patients aimually have been 
documented to be victims of medication errors (Statement of the American 
Pharmaceutical Association to the Senate Appropriations Committee Labor, 
health and Human Services Education Subcommittee Hearing on Medical Errors 
December 13, 1999). These errors indude problems arising from drug 
interactioi\s for a particular patient taking more than one drug, problems 
concerning titie response of an individual to a particular drug and incorrect 
medication for a particular condition. Medical errors further arise as a result of 
misdiagnosis. This may occur as a result of insensitive diagnostic techniques or 
a wide range of interpersonal variability in the manner in which a clinical state 
is manifest. At present, there are few tools available for optimizing prognosis, 
diagnosis and treatment of a medical condition taking into account the 
particular phenotype and genotype of an individual. 

There has been increasing interest ki herbal drugs or nutraceuticals. 
These are often grown in developing countries and undergo litde or no quality 
control. It is frequently the case that one batch of a nutraceutical may be 
effective, there is no assurance th?it a second batch will be effective Moreover, 
analysis of nutraceuticals is problematic because these drugs are complex 
mixtures in which little is known with respect to the active agent. 
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All new therapeutic agents require some form of clinical trials. It is 
known that a drug for treating tumor that is tested in a clinical trial using 
standard recruiting techniques for patients, may in fact show only limited 
efBcacy. If the beneficial effect observed in a clinical population is too small, the 
drug will not receive approval by the Food and Drug Administration for use in 
fixe population at large. However, the small beneficial effect observed may in 
fact be an artifact of the clinical trial design or the clinical endpoint in the 
population of patients. It would be desirable to have criteria for screening 
patients as they enter a clinical trial to ensure that the beneficial effect of a drug 
if it exists may be detected and quantified. 

Summary of flie Invention 

In a first embodiment of the invention there is provided a method, for 
evaluating a biological condition of a subject, that inchides: obtaining bom the 
subject a sample having at least one of RNAs and proteins; deriving from Ae 
sample a first profile data set, the first profile data set including a plurality of 
members, each membgr being a quantitativemeasure of the amount of a distinct 
RNA or protein constituent in a panel of constituents selected so tiiat 
measurement of the constituents enables meastirement of ttie biological 
condition; and producing a calibrated profile data set for the panel, wherein 
each member of the calibrated profile data set is a function of a corresponding 
member of the first profile data set and a corresponding member of a baseline 
profile data set for the panel, the caMbrated profile data set providing a measure 
of tlie biological condition of tiie subject. 

In a preferred embodiment, a method is provided for evaluating a 
biological condition of a subject, that includes obtaining from, tiie subject a first 
sample having at least one of fluid, cells and acthre agents; applying the Bist 
sample or a portion ti\ereof to a defined population of indicator cells; obtaining 
from tiie indicator cells a second sample containing at least one of RNAs or 
proteins; deriving bom the second sample a first profile data set, the first profile' 
data set including a plurality of members, each member being a quantitative 
measure of the amount of a distinct RNA or protein constituent in a panel of 
constituents selected so that measurement of the constituents enables 
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measurement of the biological condition; and producing a calibrated profile data 
set for tihe panel, wherein each member of the calibrated profile data set is a 
function of a corresponding member of the first profile data set and a 
corresponding member of a baseline profile data set for the panel, the calibrated 
profile data set providing a measure of the biological condition of tiie subject. 

In a preferred embodiment, a method is provided for evaluating a 
biological condition affected by an agent, the metiKod including: obtaining, 
from a target population of cells to which the agent has been administered, a 
sample having at least one of RNAs and proteins; deriving from the sample a 
first profile data set, the first profile data set including a plurality of members, 
each member being a quantitative measure of the amoimt of a distinct RNA or 
protein constituent in a panel of constituents selected so that measurement of 
the constituents enables measurement of the biological condition; and 
producing a calibrated profile data set'for the panel, wherein eadii member of 
Ihe calibrated profile data set is a function of a corresponding member of the 
first profile data set and a corresponding member of a baseline profile data set 
for the panel, the calibrated profile data set providing a measure of the 
biological condition as affected by the agent. 

In a preferred embodiment, a mediod is provided for evaluating the 
effect on a biological condition by a first agent in relation to the effect by a 
second agent, including: obtaining, from first and second target populations of 
cells to which tihe first and second agents have been respectively administered, 
first and second samples respectively, each sample having at least one of RNAs 
and proteins; deriving from the first sample a first profile data set and firom the 
second sample a second profile data set, die profile data sets each including a 
plurality of members, each member bang a quantitative measure of the amount 
of a distinct RNA or proteki constituent in a panel of constituents selected so 
that measurement of the constituents enables meastirement of tiie biological 
condition; and producing for tiie panel a first calibrated profile data set and a 
second profile data set, wherein (i) each member of ti\e first calibrated profile 
data set is a fimction of a corresponding member of the first profile data set and 
a corresponding member of a first baseline profile data set for the panel, and (ii) 
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each member Of the second calibrated profile data set is a function of a 
corresponding member of the second proffle data set and a corresponding 
member of a second baseline proffle data set for the panel the calibrated proffle 
data sets providing a measure of the effect by the first agent on the biological 
condition in relation to the effect by the second agent. 

In a preferred embodiment, a method of conducting a clinical trial of an 
agent, is provided, including: causing tiie blind administration of a selected one 
of a placebo and tiie agent to each candidate of a pool of subjects; and using 
quantitative gene expression to monitor an effect of such adminish^tion. 

In a preferred embodiment, a digital storage medium is provided on 
which is stored a computer readable calibrated proffle data set, wherein: the ' 
caHbrated proffle data set relates to a sample having at least one of RNAs and 
proteins derived from a target cell population to which an agent has been 
administered; Hoe calibrated proffle data set inchides a first pluraUty of 

members, each member being a quantitative measure of a change in an amount 
of a distinct RNA or protein constituent in a prael of constituents selected so 
ttiat measurement of the constituents enables measurement of abiological 
condition as afiiected by administration of tiie agent. 

In a preferred embodiment^ a digital storage medium is provided on 
which is stored a plurality of records R, relating to a population of subjects, each 
record R, corresponding to a distinct instance P, of a computer readable proffle 
datasetPwherein: each instance?, of the proffle data set Prelates to a distinct 
sample derived from a subject, tite sample having at least one of RNAs and 

a plurality of members My each member 
M, being a quantitative measure of the amount of a distinct RNA or protein 
constitiient in a panel of constituents selected so that measurement of the 
constituents enables measurement of a biobgical condition; each record R, 

includes, for each member Aljof a corresponding distinct instance P, of the 
proffle data set P, a value corresponding to the value of tiie member and 
each record R, also includes a reference to a characteristic of tiie subject relative 
to the record, tiie characteristic being at least one of age group, gender, 
eflinidty, geographic location, diet, medical disorder, dinical indicator. 
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medication, physical activity, body mass, and environmental exposure. 

In a preferred embodiment, a digital storage medixxm is provided on 
which is stored a large number of computer readable profile data sets, wherein 
each profile data set relates to a sample derived from a target cell population to 
which has been administered an agent, tiKe sample having at least one of RNAs 
and proteins; each profile data set includes a plurality of members, each member 
being a quantitative measure of the amoimt of a distinct RNA or protein 
constituent in a panel of constituents selected so that measurement of tiie 
a>nstituents enables measurement of a biological condition; and the panel is the 
same for all profile data sets. 

In a preferred embodiment of the invention, a method is provided for 
evaluating a biological condition of a subject, based on a sample from the 
subject, the sample having at least one of RNAs and proteins, the method 
including: deriving from the sample a first instance of a profile data set, the 
profile data set including a plurality of members, each member being a 
quantitative meastire of the amoimt of a distinct RNA or protein constituent in a 
panel of constituents selected so tiiat measurement of the constituents enables 
measurement of fl\e biological condition; and producing a first instance of a 
calibrated profile data set for the panel, wherein each member of an instance of 
the calibrated profile data set is a function of a corresponding member of an 
instance of the profile data set and a corresponding member of an instance of a 
baseline profile data set for the panel, tiie calibrated profile data set providing a 
measure of the biological condition of the subject; accessing data in a condition 
database, the condition database having a plurality of records relating to a 
population of subjects, each record corresponding to a distinct instance of the 
calibrated profile data set; and evaluating the first instance of tiie calibrated 
profile data set in relation to data in the condition database. 

In a preferred embodiment of the invention, a method is provided of 
displaying quantitative gene expression aiwlysis data associated with 
measurement of a biological condition, the method including: identifying a first 
profile data set pertinent to the gene expression analysis data, the first profile 
data set includiiig a plurality of members, eiich member being a quantitative 
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measure of the amount of a distinct RNA or protein cx>nstituent in a panel of 
constituents selected so that measurement of the constituents enables 
measurement of the biological condition; producing a calibrated proffle data set 
for the panel, wherein each member of the calibrated proffle data set is a 
function of a corresponding member of the first profile data set and a 
corresponding member of a baseline proffle data set for ttie panel, the calibrated 
proffle data set providing a measure of the biological condition of the subject; 
and displaying the calibrated proffle data set in a graphical format. 

A preferred embodiment is directed to a descriptive record of a change in 
a biological condition, that includes: a first set of numerical gene expression 
values for a panel of gene lod, each value in the set corresponding to a single 
gene locus in a panel of gene lod, the set of values forming a profile data set for 
a population of cells subjected to a first biological condition; a second set of 
numerical gene e>q)ression values for &e panel of gene lod, each value in tiie set 
corresponding to a single gene locus, the set of values forming a baseline proffle 
data set for a second population of cells subjected to a second biological 
condition, the second set of values optionally being an average for multiple gene 
e3q)ression values from multiple populations of cells for each locus in the panel; 

and a third set of numbers corresponding to the ratio of the first set of values 
and file second set of values wifli respect to each gene locus in tiie panel, the 
third set being a caUbrated proffle data set; the proffle data set and the calibrated 
proffle data set being descriptive of the first biological condition with respect to 
the second biological condition- 
In a preferred embodiment, a metiiod for diagnosing a biobgical 
condition of a subject is provided that includes: obtaining a sample from a 
subject; subjecting a population of cells to the sample and determining the 
presence of a first biological condition with respect to a second biological 
condition according to any of the above daims. 

In a preferred embodiment, a method is provided for diagnosing a 
susceptibility for a biological condition in a subject, that includes obtaining a 
sample from the subject; creating a descriptive record, according to the above, 
wherein the baseline set of values is an average of second values contained in a 
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library of descriptive records for ttie second biological condition; the library 
containing a pltirality of descriptive records grouped according to a 
predetermined biological condition; comparing the calibrated profile data set of 
the subject with the library of calibrated profile data sets and diagnosing the 
suscept&ility of the subject. 

In a preferred embodiment, a metiiod is provided for monitoring the 
progress of a biological condition, including: creating a plurality of descriptive 
records, according to the above; wherein each set of first values is determined at 
preselected time intervals with respect to the first record; comparing each 
calibrated profile data set with a library of calibrated profile data sets, the 
plurality of calibrated profile data sets being grouped according to a 
predetermined biological condition; and determining ttie progress of the 
biological condition with respect to gene e?q)ressiorL 

In a preferred embodiment, a method is provided for establishing the 
biological activity of a composition, including: selecting a population of cells; 
subjecting the cells to the composition; and determining the record according to 
the above description using a standardized baseline profile data set for flie 
biological conditioiu 

In a preferred embodiment, a method is provided for determining which 
therapeutic agent fix>m a choice of a plurality of therapeutic agents to administer 
to a subject so as to change a biological condition in a subject from a first 
biological condition to a second biological condition; including: subjecting a 
sample firom the subject to each of a plurality of therapeutic agents; determining 
a descriptive record for each of the samples according to any of the above 
described methods, comparing each of the calibrated profile data sets to a 
Ubrary of calibrated profile data sets, the library of calibrated data sets being 
grouped according to a predetermined biological condition; and determining 
which of tile tiierapeutic agents is capable of changing the first biobgical 
condition in the subject to the second biological condition in the subject. 

In a preferred embodiment, a method is provided for characterizing the 
biological effectiveness of a single batch of a composition produced by a 
mantifacturing process, comprising: providing a fingerprint or signature profile 
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according to any of the above methods; and labeling the batch of flie 

composition by pladng the fingerprint (signature proffle) on eadi container in 
tiie batch. 

In a preferred embodiment, a method is provided for accessing biological 
information on a digital storage medium as described above, including: making 
ti»e information available to a user. 

In a preferred embodiment, a method is provided for consumer 
evaluation of a product, wherein the consumer evaluation is dependent on a 
signature profUe, including: identifying the product using the signature profUe. 

In a preferred embodiment, a computer program product is provided for 
evaluating a biological condition of a subject or for evaluating a biological 
condition resulting from the use of an agent, inchiding a computer usable 
medium having computer readable program code thereon, the computer 
program code; including: a program code for classifying a sample from the 
subject or tiie agent for an identifiable record; a program cbde for deriving a first 

data set, the first profik data set indu<Ung a pluraKty of inembers, eadi me^ 
being a quantitative measure of the amount of a distinct RNA or protein 
constituent in a panel of constituents selected so that measurement of the 
constihients enables measurement of the biological condition; the profile data 
set being stored in the record; and a program code for optionally producing a 
calibrated profile data set for the panel, for storage in the record, eadi member 
of the calibrated profile data set being a function of a corresponding member of 
the first profile data set and a corresponding member of a baseline profile data 
set for the panel, the calibrated profile data set providing a measure of the 
biological condition of the subject 

In a prefened embodiment of the invention, a computer system for 
evaluating a biological condition of a subject or for evaluating a biological 
condition resulting fix>m the use of an agent is provided, the computer system, 
including: a classification module for classifying a sample from the subject or 
the agent in an identifiable record; a derivative module for deriving a first data 
set, the first profile data set including a plurality of members, each member 
being a quantitative measure of the amount of a distinct RNA or protein 
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constituent in a panel of constituents selected so that measurement of tiie 
constituents enables measurement of the biological condition; and a production 
module for producing a calibrated profile data set for the panel, wherein each 
member of the calibrated profile data set is a function of a corresponding 
member of the first profile data set and a corresponding member of a baseline 
profile data set for the panel, the calibrated profile data set providing a measure 
of the biological condition of the subject. 

In a preferred ^nbodiment a metitod is provided for analyzing a patient 
for a biological condition at a remote site, including: providing a kit for 
measuring a profile data base for evaluating a biological condition, the kit 
including reagents for quantitative analysis of RNA or protein for a panel of 
gene lod; accessing a centralized database containing baseline profile data sets 
corresponding to the panel; determining the calibrated profile data set for the 
patient; and analyzing the biological condition of the patient. 

Preferred embodiments of the invention indude the use of calibrated 
profile databases for determining the biological condition at one site in a subject 
from a sample taken from a second remote site. The biological condition may 
include disease, therapeutic interventions, aging, health conditioning and 
exercise, exposure to toxins, status of infection and health status. For example, 
calibrated precision profiles may be used to measure a biological condition(s) in 
one site (for example, the liver) by sampling cells firom the same subject, but at a 
different site not generally considered a target for the biological condition, for 
example, peripheral blood cells in the case of liver disease. 

Preferred embodiments of the invention include tiie use of calibrated 
profile databases for determining the biological condition of the subject that 
includes placing a cell or fluid sample on indicator cells to assess the biological 
condition, the biological condition including disease, therapeutic interventions, 
aging, health conditioning and exercise, exposure to toxins, status of infection 
and health status. 

Preferred embodiments of tite invention include tiie use of calibrated 
profile data bases and profiles to assess, compare and contrast tiie bioactivities 
of therapeutic agents and therapeutic agent candidates including comparison of 
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two agents having unknown properties ; comparison of agents that are complex 
mixtures against those that are simple mixtures and comparisons of a single 
agent against a class of agents. 

Preferred embodiments of the invention include Ae use of calibrated 
profile databases derived from in vitro dosing of an agent in indicator cells, or 
fluids or cells ex vivo to predict in vivo activities, activities including efficacy and 
toxicity and further permitting data on short term m dosing of agent to 
predict long-term activities as described herein. 

A preferred embodiment of the invention is at least one databases and its 
uses, the databases containing at least one of calibrated profile data sets and 
baseline profile data sets for discrete populations identified according to factors 
including diseases, geography, eflmidty, age and state of health. 

A preferred embodiment of the invention is a database corresponding to 
an individual over time, the uses including managing a personalized health care 
program. 

Additional embodiinents indude methods of nmning a clinical ^ 
calibrated profile data and databases containing calibrated profile data torn in 
vitro and in vivo studies of the e^ of the agent on populations of cells and 
methods of building a clinical research network that uses calibrated profile data 
and traditional medical data. 

Brief Description of the Drawing s 

The foregoing features of the invention will be more readily understood 
by reference to the following detailed description, taken with reference to the 
accompanying drawings, in which: 

Figure 1 is a diagram showing fte flow of information from data acquired 
in molecular pharmacology and toxicology, dinical testing, and use of the data 
for the application to individualized medicine. 

Figure 2 is a diagram showing the drug discovery pathway of new 
compounds from early leads to likely drug candidates. Although calibrated 
profile data sets are indicated at the pre-clinical step, gene expression data can 
be acquired and is useful at any of the stages shown. IND refers to investigative 
new drug and refers to an early stage in regulatory review. 
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Figure 3 is a diagram presenting a comparison of in vivo and in vitro 
protocols for forming calibrated profile data sets for rapidly assessing product 
candidate toxicity and efficacy in accordance with several embodiments of the 
present invention. 

Figure 4 is a diagram showing the application of gene expression 
profiling as a guide to pre-dinical and clinical studies in accordance with an 
embodiment of the present inventioiL 

Figure 5 is a diagram showing a method in accordance with an 
embodiment of the present invention for obtaining profile data in flie absence of 
a stimulus and in the presence of a stimtilus. 

Figure 6 is a diagram showing the creation of a library of profile data 
associated witti a plurality of subjects in accordance with an embodiment of the 
presait invention. 

Figure 7 is a diagram illustrating the structure of a profiOie data record in 
accordance with an embodinient of the present invention. 

Figure 8 is a diagram illustrating a data entry screai for a data record of 
the type shown in Figure 7 and typical cont^ in which data records may be 
compiled in accordance with embodiments of tiie present invention. 

Figure 9 sho\^^ an embodiment of the present invention in which profile 
data; in either the raw or calibrated form, is evaluated using data from a 
database tiiat is remotely accessed over a network 

Figure 10 shows a schematic of a phase two clinical trial that utilizes gene 
expression profiling (a) The right hand panel (b) indicates that the same 
information may be used in Phase IV or post marketing studies to compare the 
efficacy of already approved and marketed drugs or to guide the marketing of 
such ttierapies; to guide tiie choice of tiierapy for an individual subject or 
population from within a dass of appropriate compounds. 

Figure 11 is a bar graph tiliat shows a graphical representation in the form 
of a histogram representing calibrated profile data sets based on quantitative 
expression of RNA in cells of a whole blood sample using a panel of 12 
constituents where each constituent corresponds to a uiuque gene locus, (a) The 
blood sample is stimulated ex vivo with heat killed staphylococci are further 
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exposed H7-TPCK, H9-UT-77, or H16-Dex as indicated. Ihe baseline profile 
data set is a blood sample stimulated ex vivo (in vitro) with heat killed 
staphylococci (b) The blood sample is stimulated ex vivo with 
lipopolysaccharide (LPS) and is then further exposed to conqwunds H7-TPCK, 
H9-UT-77, or H16-Dex as indicated. 

Figure 12 is a bar graph with a logarithmic axis that shows a graphical 
representation of calibrated profile data sets for whole blood stimulated ex vivo 
with Upopolysaccharide (LPS), using a panel of 9 constituents, each constituent 
corresponding to a gene locus encoding the geiie products indicated, the blood 
being further exposed to anti-inflammatoiy agents: methotrexate, 
medofenamate and methylprednisolone. Ihe baseline profQe data set is derived 
from LPS stimulated (but otherwise untreated) cells. 

Figure 13 are bar graphs witii a logarithmic axis that shows a graphical 
representation of calibrated profQe data sets for two difierent samples of whole 
blood (a) 991116 and (b) 991028 reflecting the biological condition of the cells 
using a panel of 24 members, each member corresponding to a gene locus, the 
baselineproffledatasetbeingderived&<HnuntreatedcelIs. The cahT,rated data 
sets for cells exposed for six hours to three inflammation inducing agents 
(Upopolysaccharide, heat kiUed staphylococci, and phytohemaglutinin) are 
compared for each sample, (c) shows a direct comparison of LPS stimulated 
991116 with respect to 991028 as the baseline profile data set (d) shows a direct 
comparison between unstimulated 991116 and 991028. 

Figure 14 is a bar graph with a logarithmic axis that shows a graphical 
representation of calibrated profile data sets using a panel of 22 constituents, 
each constituent corresponding to a gene locus, the baseline profile data set 
being derived from untreated cells. Whole bk)od is es^Kjsed for sbc hours ex rtw 
to three inflammation indudng agents (Kpopolysaccharide, heat killed 
staphylococci, and phytohemaglutinin) which are then treated with a single 
anti-inflammatory agent (methyl prednisolone) to reveal similarities and 
differences in the effect of a single agent on cell populations differing in their 
biological condition. 
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Kgure 15 is a bar graph witt\ a logarithmic axis that shows a graphical 
representation of calibrated profile data sets for whole blood where one 
calibrated data set refers to a subject (subject 2) who has been treated in vivo 
with a corticosteroid (dexamethasone), a second data set refers to the treatment . 
of a blood sample from the same subject prior to in vivo treatment where that 
sample has been treated ex vivo (in vitro) and the third data set refers to a second 
subject treated in vivo witit dexamediasone (subject 1). The data sets 
demonstrate the reproducibility and predictability of an ex vivo {in vitro) 
treatment of blood compared to in vivo treatment with the same agent The 
figure also shows minor variation between samples from different svibjects 
reflecting interpersonal variability. A panel of 14 constituents is provided. The 
baseline profile data set is derived from tmtreated whole blood from the cognate 
subject. 

Figure 16 is a bar graph with a logarithmic y axis that shows a graphical 
representation of calibrated profile data sets for whole blood where one 
calibrated data set refers to (a) 2 subjects who have been treated m vivo with an 
inactive placebo for 3 days and (b) active prednisolone for 3 days at 100 
mg/day* The data set shows some variation between samples from different 
subjects treated with the same drug. The data sets demonstrate similarity of 
responses across the same gene lod, as well as, quantitative variation at other 
lod suggesting quantifiable interpersonal variation. A panel of eight members 
is provided. The baseline profile data set is derived from untreated whole 
blood. 

Figure 17 is a bar graph with logarithmic y axis that shows a graphical 
representation of calibrated precision profile data sets for two samples taken 
from a single subject wititun a 19 day period using a panel (e.gv inflammation 
panel) of 24 members where each member corresponds to a unique gene locus. 
The basdine profile data set relates to peripheral blood taken from the subject 
prior to treatment. 

Figure 18 (a-e) are bar graphs with a logarithmic axis that show a 
graphical representation of calibrated profile data sets for each of 5 subjects from 
which a blood sample has been taken. Each of the blood samples was exposed 
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to the inflanunatoiy agent phytohemagglutin (PHA) or to a therapeutic agent 
(anti-inflammatory agent) at different concentrations: O.ljiM, 0.3uM, 1\jM, S^M 
and SjiM, for a 4 hour period ex vivo {in vitro) so as to determine the optimum 
dose for treating the subject A panel of 6 constituents were used corresponding 
to 6 gene locL The baseline profile data set was untreated sample obtained from 
the cognate donor. 

Figure 19 is a bar graph with a logarithmic axis that shows a graphical 
representation of calibrated profile data sets for three different subjects having 
different biological conditions using a panel with 24 constituents. Ihe profile 
data sets show variabiUty according to these conditions providing the basis for a 
diagnostic signature panel, (a) shows a calibrated profile data set for a smoker 
against a baseline for a non-smoker, (b) shows a calibrated profile data set for a 
subject with chronic obstructive pulmonary disease against a baseline for a 
subject lacking this disease. The baseline profile data set is derived ftom a 
subject that is "normal" witii respect to these conditions. 

Figure 20 illustrates fliat an individual responses can be distinguished 
from a similarly treated population. A comparison of the response of a single 
animal compared to its e)^)erimental cohort (n=i5 animals) witii respect to a 
single locus (GST-P) is provided. The baseline data set is the cohort average. 
The figures shows that tiiis animal varied significantly from the daily, 
population average in the first two days of the study, but became more similar 
to tiie cohort average with time after treatment with acetaminophen. 

Figure 21 is a bar graph with a logarithmic axis tiiat shovra a graphical 
representation of calibrated profile data sets for samples of blood treated ex vivo 
witii LPS or LPS and one of three anti-inflammatory herbals (Echinacea, Arnica 
or Siberian Ginseng) at a concenteation of 200 ug/mL A panel of 24 constituents 
is used. The baseline profile data set is derived fix>m LPS stimulated cells absent 
a herbal treatment. Hie figure illustrates the effectiveness of the use oiF the 
calibrated precision profile to investigate the overall effects of complex 
compounds such as nutraceuticals whose biological effect is a summation of 
more than one activity. In this case, each of the herbals is consumed as an 
immunostimularit, however Ae calibrated precision profiles reveal a unique 

14 

SUBSTITUTE SHEET (RULE 26) 



wo 01/025473 



PCTAJSOO/17846 



pattern shows a mixture of both immunostimtilatory and anti-mflammatory 
effects. 

Figure 22 is a bar graph with a logarithmic axis that shows a graphical 
representation of calibrated profile data sets for samples of blood treated ex moo 
with LPS or LPS and methylprednisolone or LPS and Arnica. The baseline 
profile data set is LPS treated blood sample. 

Figure 23 is abar graph with a logarithmic axis that shows a graphical 
representation of calibrated profile data sets for samples of THP-1 cells treated 
with LPS or LPS and Amica.at three difiierent concentrations using a panel of 22 
constituents. The baseline profile data set is untreated THP-1 cells. The figure 
illustrates a concentration response with respect to the gene e)q)ression across 
the calibrated profile. 

Figure 24 is a bar graph with a logarithmic axis that shows a graphical 
representation of calibrated profile data sets for samples of THP-1 cells treated 
ex vivo with four different commercial brands of Echinacea using a panel of 8 
constituents. The baseline profile data set is untreated THP-1 cells. 

Figure 25 illustrates the use of the calibrated profile to compare relative 
efficacy across brands, or different formulations. Calibrated profile data sets for 
herbal preparations from different manufecturing sources with respect to an 
indicator monocytic cell line (THP-1) are shown graphically, the baseline profile 
data set being THP-1 cells absent the herbaL (a) three commercial herbal 
Echinacea preparations at 250 (ug/ml); (b) three herbal preparations at different 
concentrations (250ug/Eal, 50ug/ml and 3-lOug/ml) (c) fotir commercial 
Echinacea brands at 250 ug/ml). 

Detailed Description of Specific Embodiments 

As used ia this description and the accompanying claims, the following 
terms shall have tiie meanings indicated, unless the context otherwise requires: 

A ''collection of celfe" is a set of cells, wherein the set has at least one 
constituent. 
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A "population of cells" includes one or more ceUs. A population of cells 
may refer to cells m WW or torn Win) cultures. In wfro cultures may include 
organ cultures or cell cultures where ceU cultures may be primary or continuous 
cellculturesofeukaryoticorprokaryoticceUs. CeU lines can be primary 
cultures or ceU samples, e.g. from a tumor, from blood or a blood fraction, or 
biopsy explants from an organ, or can be established cell lines or microbial 
strains. 

A ''region of tiie subject" from which proteins are obtained may be (but is 
not required to be) the same part of the subject from which has been obtained a 
collection ofcellsorapopulationofrells. The cells and the proteins may both 
be obtained from blood of the subject, for exan^le. Alternatively, for example, 

thecellsmaybeobtainedfrombloodandtheproteinsmaybeobtainedfroma 
scraping of tissue or vice verca. Similarly, the proteins may be obtained from 
tirine of ttie subject, for example, whereas the cells may be obtained elsewhere, 
as, for example, from blood. 

A"panel" of genes is a set of genes mduding at least two constituents. 
A "normative" condition of asubjectto whom a composition is to be 

administered means the addition ofasubject before administration, even if the 
subject happens to be suffering from a disease. 

An "expression" of a gene includes the gene product whether messenger 
RNA or protein resulting from franslation of the messenger RNA. 

A 'Oarge number" of data sets based on a common panel of genes is a 
number of data sets sufficiently large to permit a statistically significant 
conclusion to be drawn with respect to an instance of a data set based on flie 
same panel. 

A "biological condition" of a subject is the condition of the subject in a 
pertinent realm that is under observation, and such reabn may include any 
aspect of the subject capable of being motiitored for change in condition, such as 
health, disease including cancer; frauma; aging; infection; tissue degeneration; 
developmental steps; physical fitness; obesity, or mood. As can be seen, the 
conditionsmaybechronicoracuteorsimplytransient Moreover, a targeted 
biological condition may be manifest throughout the organism or population of 
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cells or may be restricted to a specific organ (such as skin, heart, eye or blood). 
The term '^biological condition'' includes a "physiological condition". 

The 'l)lind administration" of a selected one of a composition or placebo 
to a subject in a clinical trial involves administering the composition or placebo 
to the subject in accordance with a protocol pursuant to which the subject lacks 
knowledge whether flie substance admmistered is ttie composition or a placebo. 

An "organism'' is any living cell including microorganisms/ animals and 
plants. An animal is commonly in this context a mammal, but may be a 
vertebrate non-mammal, as e.g., a zebra fish, or an invertebrate, as, e.g. 
Caenorhabditis elegans. 

An "agent" is a composition or a stimulus. A "stimultis" may include, for 
example ultraviolet A or B, or light therapy for seasonal affective disorder, or 
treatment of psoriasis with psoralen or treatment of melanoma with embedded 
radioactive seeds, otihier radiation ecposxire, etc. A "composition" includes a 
chemical compound, a nutraceutical, a combination of compoimds, or a complex 
mixture. 

A "clinical indicator" is any physiological datum used alone or in 
conjunction with other data in evaluating the physiological condition of a 
collection of cells or of an organism. This term includes preclinical indicators. 

A "signature panel" is any panel representing a subclass of constituents 
where tiie subclass of constituents is selected according to the relatively high 
level of information concerning a biological condition imparted by each member 
of the data set. 

"Distinct RNA or protein constituent" in a panel of constituents is a panel 
tiiat includes at least one of RNA and protein and each constituent of the panel 
is distinct. 

A preferred embodiment of the invention is the formation of calibrated 
data sets that describe a biological condition or an effect of an agent on a 
biological condition. A calibrated data set represents a set of values tiiat 
correspond to variations in gene expression where the variations are 
informative. This approach does not require comprehensive analysis of all gene 
expression in target cells associated with a particular conditioru Nor is any one 
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Single gene locus necessarily of particular significance. Rattier a pattern of 

variation (a profile) is sought that correlates, in a reproducible manner, with a 

particular condition. Iliere may be no a pnon knowledge of a correlation but 

rather a .correlation may be established by evaluating a panel of constituents of 
reasonable size (for example up to 100 constituents) and iteratively testing the 

gene e;q>ression profiles for different subjects or for the same subject from whi A 
the most informative lod for a particular condition may be selected. An 
informative subgroup of constituents in a panel may be selected that 
consistently vary for a particular condition and this subgroup may then become 

the signature panel, the signature panel giving rise to a signature profile. 

In hirther embodiments of the invention, any calibrated data set for an 
individual that has more members than reflective of a single signature panel 
may be mined for calibrated profiles that correspond to additional signature 
panels thereby potentially providing new insi^ into medianisms of action of 
abiological condition onsets of genes. Measui«nent of changes in transcribed 
RNA m a cdl as a result of an environmental change or aging is an exqu^ 
sensithremeasureoftheresponseofacelL Techniques available today to • 
quantify transcribed KNA in a cell add to the sensitivity of the approach. The 
preferred embodiments of the invention, which are directed to patterns of 
change in amounts of transcribed RNA, provide a means to focus and interpret 
this rich information- 

In contrast to the above approach, much attention in the prior art has 
been directed to the sequencing of the human genome and the identification of 
all the genes encoded therein. Accompanying the growing amount of sequence 
data, microarrays provide a means to survey thousands of gene sequences for 
mutations. Microarrays are being used to provide DNA profiles that identify 
mutations in an indhndual and those mutations will be associated with 
predictions concerning development of disease in those individuals. 
Transcriptomicsandproteomicsisnowthefocusofincreasingattention. These 
studies are directed to analyzing the entire body of RNA and protein produced 
by Uving cells. NCcroarrays provide a method for analyzing many thousands of 
di&renthumanRNAsas to whethertheyareexpressed and bywhich cells. For 
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example, a project undertaken by the National Cancer Institute and others to 
examine mRNAs produced by varioiis types of cancer cells, have revealed 
50,000 genes that are active in one or more cancers. The goal of these studies is 
to identify novel cancer drugs that are directed to knocking out or enhancing tiie 
production of certain proteins. ( Kathryn Brown, The Human Genome Business 
Today, Scientific American, July 2000, p^; Julia Karow, The "Other" Genomes, 
Scientific American, Jtdy 2000, p.53; Ken Howard, 'The Bioinf ormatics Gold 
Rush, Scientific American, Jtily 2000, p.58; Carol Ezzell, Beyond the Human 
Genome, Scientific American, July 2000, p.64; all incorporated by reference). 
Major efforts in correlating genetic variation of individuals and the functional 
interrelationships of genes in health and disease are being conducted in a 
variety of consortia including the single nucleotide pol5rmorphism consortixmi 
and the Human Epigenome Consortium (Beck et al. Nature BioTechnology 17 
(1999) p 1144). The Epigenome Consortium plans to anal3^e sets of genome 
fragments from both healttiy and diseased individuals in the 500 different 
human tissues (Bioworldbitemationak December 22, 1999). This approach 
seeks to correlate absolute expression of genes associated with a particular 
condition witii the presence of that conditioiu Examples of prior art that seek to 
measure gene expression in absolute amoimts including by subtractive methods 
or by determining amounts with respect to housekeeping genes or by targeting 
a single gene expression system are U.S. 5,643,765; U.S. 5,811,231; U.S. 5,846,720; 
U.S. 5,866,330; U.S. 5,968,784; U.S. 5,994,076; WO 97/41261; WO 98/24935; WO 
99/11822; WO 99/44063; WO 99/46403; WO 99/57130; WOOO/22172 and 
WOOO/11208. 

We have taken a different and novel approach to the above by identifying 
reproducible patterns of variation of gene expression that are informative by 
virtue of the degree of variation between a sample and a baseline for example a 
subject with the condition arid the subject fliat lacks flie condition. The 
variations may be correlated witti other non- genetic indications such as clinical 
indicators (for htmians) of a traditional nature but are not required per se to be 
causative. Accordingly, the amovmt of gene expression product (for example 
RNA traitscript) produced by a gene locus in a cell tmder certain circumstances 
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is measured and then stored as a value in a first profile data set. IWs value is 
calibrated with respect to a second value (a baseline profile data set) to provide 
a member of a caHbiated profile data set. The values recorded for the profile 
data set, relying on a particular baseline data set to produce a calibrated data set 
become part of the descriptive record any or all of which can be stored in a 
database which may be accessed through a global network such fliat any new 
data in the form of a profile data set or a caKbrated profile data set measured at 
any global location can be directly compared to an archive of descriptive records 
including caKbrated profile data sets and baseline data sets so as to extend the 
stored library of profiles and provide predictive or diagnostic data about a 
particular biological condition or agent. 

We have exemplified tiie use of selected panels of constituents 
corresponding to gene lod firom which quantitative gene expression is measure 
by for example quantitatively measuring the transcribed RNA in a sample of a 
subject, for appMcations that include: (a) measurement of therapeutic efficacy of 
natural or synthetic compositions or stimuli that may be fonnulated 
individually or in combinations or matures for a range of targeted physiological 
conditions; (b) predictions of toxicological effects and dose effectiveness of a 
composition or mbcture of compositions for an individual or in a population; (c) 
determining how tvvo different agents administered in a single treatment might 
interact so as to detect any of synergistic , additive, negative, neutral of toxic 
activity (d) performing pre-dinical and clinical trials by providing new criteria 
for pre-selecting subjects according to informative profile data sets for revealing 
disease status and conducting preliminary dosage studies for these patients 
prior to conducting phase 1 or 2 trials. Gene expression profiling may be used 

to reduce the cost of phase 3 clinical trials and may be used beyond phase 3 
ttals; (e) labeling for approved drugs; (f) selection of suitable medication in a 
dass of medications for a particular patient tiiat is directed to their unique 
physiology; (g) diagnosing or detennining a prognosis of a medical condition or 
an infection which may precede onset of symptoms or alternatively diagnosing 
adverse side effects associated witii administration of a tiierapeutic agent; (h) 
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managing the health care of a patient; and (i) quality control for different 
batches of an agent or a mixture of agents* 
The Subject 

The methods herein can be applied to a subject tiiat includes any living 
organism where a living organism includes a prokaryote such a bacterium or a 
eukaryote including single celled eukaryotic organisms at one end of the 
spectrum and htunans at the other and everything in between including plants. 
The figures relate to calibrated profile data sets obtained from humans and 
mammals. Nonetheless, the methods disclosed here may be applied to cells of 
other organism without the need for imdue experimentation by one of ordinary 
skill in the art because all cells transcribe RNA and it is know in the art how to 
extract RNA from all types of cells. 

A tissue sample might include a single cell or multiple cells or fragments 
of cells. Body fluid includes blood, urine, q)inal fluid, lymph, mucosal 
secretions, hemolymph or any other body fluid known in the art for a subject. 
For an animal subject, a tissue or fluid sample may be obtained by means of a 
biopsy needle aspirate, a lavage sample, scrapings and surgical incisions or 
ottier means known in the art 
Panels 

Steps in selecting constituents in a panel include searching publicly 
available medical literature for RNA or proteins or sets of RNAs or proteins that 
directly or indirectly vary with a particular biological condition. A panel 
containing up to 100 constituents may be selected. According to the condition 
being examined, just a small subset of the panel constituents may be 
informative. In determining membership of the panel of genes, it is not ' 
necessary for die panel to be an exhaustive selection. Rather it is desu^ed to 
obtain from tiie panel an expression profile that discriminates consistently with 
respect to the targeted physiological or biological condition. Moreover, a panel 
is not necessarily selected according to an expected profile of gene expression in 
cells tiiat directly respond to a biological effect. For example, gene expression 
associated with liver metabolism may be analyzed in a blood sample. Figures 20 
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and 22 provide calibrated profiles of whole blood treated with herbal agents 
using markers for livermetabolism. 

The number of constituents in a panel can vary. According to the 
examples provided below, panels of up to 24 genes are selected for evaluating 
expression levels. Although a panel may be as large as 100 constituents, it is 
desirable for a particular panel to have no more than 24 constituents, more 
particularly, less than 12 constihients. For example, subsets of no more than 8 
genes have been used that may be derived from a larger panel but which are 
sufficiently informative to effectuate discrimination. Ihe number of 
constituents in a panel for which expression is monitored may vary widely 
depending on thecontext: For example. Figure 1 describes data acquisition 
from in vitro cell culture and from animal toxicology shidies, which includes 
expression of about 25 to 100 or more genes. In contrast, selection of markers or 
surrogate markers indude for example three to 100 genes, preferably five to 50 

or five to 25 genes to be analyzed from samples obtained in clinical shidies. In 
this manner markers or surrogate rna-kas haying predktive value for a medical 
condition, such as a genetic predisposition, a response to therapeutic agent, an 
inflammatory condition, or an infection, etc can be identified and cumulatively 

largerpopulations can be obtained to refine the correlations. A health profile 
can then be generated for an individual subject using a low volume blood 
sample. The blood sample can be analyzed for expression profile data of about 
100 - 500 genes, comprising markers or surrogate markers of a number of 
medical conditions (Fig. 1: right panel). Panels of varying sizes may be utilized 
as necessary and subsequent refinements in methodotogy may lead to selection 

of subsets having panels as large as 15 genes or 12 genes or as small as 6, 5, 4, 3 
or 2 genes. 

It is envisaged that any single biological condition may be described by a 
signature panel having a smaU number of highly informative constihients 
providing a signature calibrated profile (also referred to as a fingerprint). The 
presence of highly informative lod is demonstrated in several of the 
accompanying figures. For example. Figure 11(a) n-2, n-4 and U-5 appeared to 
be highly informative. Highly informative constihients in Hgtire 21 include the 
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interleukins. The signature panel may provide a signature profile or fingerprint 
which is sufficiently robust to serve as a standard in describing a particular 
biological condition or an effect of a particular agent on a biological condition. 

For purposes.of illustrating a signature panel, constituents of a panel for 
measuring inflammation have been provided that are informative witii respect 
to a particular biological condition. For example, we have used a panel for 
inflammation that has 6 constituents- Il-la, 11-6, 11-8, 11-18, GMCSF and IFN-g in 
Figure 18(aHe) to determine the response of 5 subjects to varying 
concentrations of drugs. This group of constituents is a subset of a larger panel 
of inflammation related gene lod such as shown in Figxire 19a and Figure 19b 
where the Inflammatory Panel includes Il-a, Il-b, 11-2, 11-3, 11-4, 11-6, n-7, 11-8, 11-10, 
Il-12p40, n-15, n-15, 11-18, GM-CSF, Ifn-gamma, TGF-b, cox-2, ICE, MMP-9, 
ICAM, TNF-a and TNF-b. The subset of constituents were selected on the basis 
of the information sought concerning die biological conditiorL 

Embodiments of the invention provide examples of at least 4 different 
panels which may be used separately or together. These panels are an 
inflammatory panel (INF-a, Il-lb, ICAM, 11-8, 11-10, Il-12p40, ICE, cox-2, cox-1 
and mmp-3) a cell growth and differentiation panel (c-fos, c-jtm and STATS), a 
toxicity panel (SOD-1, TACE, GR, HSP70, GST, c-fos, c-jun, INOS) and a Uver 
metabolism panel (INOS, cyp-a and u-pa). Other panels include skin response or 
prostate cancer or endothelial/cardiovascular response panels or cell growth or 
differentiation or liver metabolism panels. Although provided as examples, the 
above panels are not intended to be limiting. 
Gene Expression 

For measuring the amotmt of a particular RNA in a sample, we have used 
methods known to one of ordinary skill in the art to extract and quantify 
transcribed RNA from a sample with respect to a constituent of a panel RNAis 
extracted firom a sample such as a tissue, body fluid, or culture mediimi in 
which a population of a subject might be growing. For example, cells may be 
lysed and RNA eluted in a suitable solution in which to conduct a DNAse 
reaction. First strand synttiesis may then performed xising a reverse 
transcriptase. Gene amplification, more specifically quantitative PCR assays, 
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can then conducted and the gene of interest size calibrated against a marker 
such as 18S rRNA (Hirayama et al.. Blood 92, 1998: 46-52). Samples are 
measured in multiple dupUcates for example, 4 repUcates. Relative quantitation 
of the mRNA is determined by the difference in threshhold cycles between the 
size marker and the gene of interest In an embodiment of the invention, 
quantitative PCR is performed using amplification, reporting agents and 
instruments such as those suppKed commercially by PE Biosystems (Foster City, 

CA). Givenadefinedefficiencyofamplificationoftargettranscripts,thepoint 
(e.g., cyde number) that signal from amplified target template is detectable may 
be directly related to the amount of specific message transcript in the measured 
sample. Similarly, other quantifiable signals such as fluorescence, enzyme 
activity, disintegrations per minute, absorbance, etc., when correlated to a 
known concentration of target templates (e.g., a reference standard curve) or 
normalized to a standard with limited variability can be used to quantify the 
number of target templates in an unknown sample. 

Although not limited to amplification methods, quantitatwe gene 
expression techniques may utilize amplification of the target tr^jscript. 
Alternatively or in combination with amplification of the target transcript, 
amplification of the reporter signal may also be used. Amplification of the 
target template maybe accomplished by isothermic gene amplification 
strategies, or by gene amplification by thermal cycling such as PCR. It is 
desirable to obtain a definable and reproducible correlation between the 
amplified target or reporter and tiie concentration of starting templates. 

It is envisaged that techniques in the art using microfluidics for example 

andhighlysensitivemarkerswillenablequantitationofRNAtooccurdirectfy •• 
from a single cell or lysed cell. Hiis may rely on amplification of a marker but 
may not require amplification of tiie transcripts themselves. The amount of 
transcript measured for any particular locus is a data point or member of the 
first profile data set for a particular panel 

According to embodiments of the invention, a first profile data set is 
derived from the sample, the first profile data set including a pluraUty of 

members, each member being a quantitative measure of the amount of a RNA 
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transcribed from a gene locus, the gene locus being a constituent in a panel of 
constituents. A first profile data set may be obtained from a quantitative 
measure of the amoimt of a distinct RNA or protein corresponding to a gene 
locus. The figures provided here are directed to RNA. However, the method 
could be applied using proteins where sensitive quantitative techniques are 
available for measiuring Hie amount of a distinct protein in a cell. 
Baseline Profile Data Sets 

The analyses of samples from single individuals and from large groups of 
individuals provide a library of profile data sets relating to a particular panel or 
series of panels. These profile data sets may be stored as records in a library for 
use as baseline profile data sets. As the term ''baseline" suggests, the stored 
baseline profile data sets serve as comparators for providing a calibrated profile 
data set that is informative about a biological condition or agent It is 
anticipated that many baseline profile data sets will be stored in libraries and 
classified in a number of cross-referential ways. One form of classification 
might rely on the characteristics of tiie panels from which the data sets are 
derived. Another form of classification might be the use of a particular 
biological conditiorL The concept of biological condition encompasses any state 
inwhichacellorpopxilationof cells might be at any onetime. This state mig^t 
reflect geography of samples, sex of subjects or any otiier discriminator. Some 
of title discriminators may overlap. The libraries might also be accessed for 
records associated with a single subject or particular clinical trial. The 
classification of baseline profile data sets may further be aimotated with medical 
information about a particular subject, a medical condition, a particular agent 
etc 

The choice of a baseline profile data set for creating a calibrated profile 
data set is related to the biological condition to be evaluated, monitored, or 
predicted, as well as, the intended use of the calibrated pand, e.g., as to monitor 
drug development, quality control or other uses. It might be desirable to access 
baseline profile data sets from the same subject for whom a first profile data set 
is obtained or firom different subject at varying times, exposures to stimuli, 
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drugs or complex compounds; or may be derived from like or dissimilar 
populations. 

The profile data set may arise from the same subject for which the first 
data set is obtained, where the sample is taken at a separate or similar time, a 
different or similar site or in a dififerent or similar physiological condition. For 
example, Kgure 5 provides a protocol in which the sample is takai before 
stimulation or after stimulation. The profile data set obtained ftom the 
unstimulated sample may serve as a baseline profile data set for the sample 
taken after stimulation. The baseline data set may also be derived from a Kbrary 
containing profile data sets of a population of subjects having some defining 
characteristic or biological condition. Hie baseline profile data set may also 
correspond to some ex vivo or in vitro properties associated with an in vitro ceU 
culture. The resultant calibrated profile data sets may then be stored as a record 
in a database or library (Figure 6) along with or separate from the baseline 
profile database and optionally the first profile data set although the first profile 
data set would normally become incorporated into a baseline profile data set 
imder suitable classification criteria. 

Selected baseline profile data sets may be also be used as a standard by 
which to judge manufacturing tots in terms of efficacy, toxicity, etc Where ti\e 
effect of a tiierapeutic agent is being measured, ttie baseline data set might 
correspond to gene expression profiles taken before administration of the agent. 
Where quality control for a newly manufachired product is being determined, 
the baseline data set might correspond with a gold standard for that product. 
However, any suitable normalization techniques may be employed. For 
example, an average baseline profile data set is obtained from authentic material 
of a naturally grown herbal nutraoeutical and compared over time and over 
different lots in order to demonstrate consistency, or lack of consistency, in lots 
of compotinds prepared for release. 
Calibrated Data 

A calibrated profile data set may be described as a function of a member 
of a first profile data set and a corresponding member of a baseline profile data 
set for a given gene locus in a panel For example, calibrated profile data sets 
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may be derived by calculating a ratio of the amount of RNA transcribed for a 
panel.constituent in a cell sample in an environmental including intervention 
such as a therapeutic treatment or at a particular time (first profile data set) with 
respect to the amount of RNA transcribed for the same panel constituent in a 
cell that differs in some manner from the sample (baseline profile data set) 
(Figures 5 and 6). We have foimd that calibrated profile data sets to be 
reproducible in samples tiiat are repeatedly tested (Figure 17). We have also 
foimd that calibrated profile data sets obtained when samples from a subject are 
exposed ex vivo to a compoimd are comparable to calibrated profile data from a 
sample that has been exposed to a sample in vivo (Figure 14 and Figure 
16(a),(b)). We have also found that an indicator cell line treated with an agent 
can provide comparable calibrated profile data sets to those obtained from in 
vivo or ex vivo populations of cells (Figure 15). Moreover, we have found that 
administering a sample from a subject onto indicator cells can provide 
informative calibrated profile data sets with respect to the biological condition 
of the subject including tite health, disease states, therapeutic interventions, 
aging or exposure to environmental stimuli or toxins of die subject (Figure 25). 

A preferred use of a calibrated profile data set is to evaluate a biological 
condition of a subject. This may be for purposes of diagnosis or prognosis of a 
clinical disorder. It is desirable to obtain a calibrated data set that describes a 
state of health or alternatively a state of age or body mass or any condition or 
state that an individual subject might find themselves to be in. For example, tiie 
biological condition might relate to physical activity, conditioning or exercise, ' 
mental state, environmental fector such as medication, diet, or geography or 
exposure to radiation or environmental contamination or infectious agent, 
biological or environmental toxin. If health or conversely a clinical disorder is 
being evaluated, calibrated profiles data sets may be used for monitoring change 
in health status by periodic or regtdar comparison of profiles; the disorder may 
be a complex disease process possibly involving.multiple gene including 
inflammation, autoimmime disease, degenerative disease, allergy, vascular 
disease, ischemia, developmental disease, hormonal conditions and infectious* 
diseases. The clinical disorder may further include arthritis, asthma, multiple 
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sclerosis and perimenopausal changes. The biological condltionmay affect a 
system of a subject including a respiratory, vascular, nervous, metaboKc, 
urinary, reproductive, struchiral and immunological system or other metaboKc 
state. The above examples of a biological condition are given by way of 
illustration and are not intended to be limiting. 

Similarly, calibrated profile data sets can be used to measure, monitor or 
predict the host response to an infectious agent for purposes of identifying the 
infectious agent, assessing the duration of infection, the extent of exposure or 
making tixerapeutic decisions. 

The evaluation of activity of an agent may require a series of caUbrated 
profiles. It is here shown that calibrated profile data sets can be used to describe 
the biological activity of an agent that might be a single compound or a complex 
compound such as a nutraceutical or herbaL The agent can be assayed using 
indicatorcells,erwi»cellpopulationsorbymtatwadministration. Hieseassays 
may rely on a series of signature panels or enlarged panels for different 
biological conditions. Hie resultant calibrated profiles may then be used to infer 
likely in vivo activity fixan the m vitro study. Insights into toxicity and 
mechanisms of action can also be inferred firom calibration profile data sets. For 
example, the herbal Echinacea is beUeved to have both immunostimulatory and 
anti-infiammatory properties although neither has been measured 
systemmatically. We have provided a systematic approach to investigate the 
biological activities of these and other herbs. We investigated the alleged 
immunostimulatory properties of the herbs by comparing the effect of treating 
the indicator cell line THP-1 or peripheral blood cells witii the agent to 
untreated cells. Untreated cells indude LPS stimulated untreated cells. 
Untreated cells were used as a baseline profile data set to measure the difeience 
in gene expression between a baseline profile data set and the experimental 
treatment with the compound. Baseline profile data sets included a single 
sample or an average value firom a series of experiments. Hie resultant 
caKbrated profile data sets could then be compared with a library of caHbrated 
profile data sets for a particular herb or/and libraries associated with different 
agents or conditions. 



28 



SUBSTITUTE SHEET (RULE 26) 



wo 01/025473 



PCTAJSOO/17846 



From the information obtained about a previously undescribed agent, a 
signature panel may be derived optionally together with a signature profile to 
serve as a gold standard for testing other batches of the same agent* 
Calculation of Calibrated Profile Data Sets and Computational Aids 

The function relating the baseline and profile data sets is in a preferred 
embodiment, a ratio e)q>ressed as a logarithm. The calibrated profile data set 
may be expressed in a spreadsheet or represented graphically for example, in a 
bar chart or tabular form but may also be expressed in a three dimensional 
representatioiu Preferably the constituent is itemized on tiie x-axis and the 
logarithmic scale is on the y-axis. Members of a calibrated data set may be 
expressed as a positive value representing a relative enhancement of gene 
expression or as a negative value representing a relative reduction in gene 
expression with respect to the baseline. 

Each member of tiie calibrated profile data set should be reproducible 
within a range with respect to similar samples taken fir om the subject under 
similar conditions. For example, ihe calibrated profile data sets may be 
reprodudble within one order of magnitude with respect to similar samples 
taken firom the subject under similar conditions. More particularly, die 
mezxibers may be reproducible wititiin 50% more particularly reprodudble 
witihin 20%. Each member of the calibrated profile data set has a biological 
significance if it has a value differing by more tiuui an amoimt D, where 
D=5F(1.1)-F(.9) and F is a second function. 

It is the pattern of increasing, decreasing and no change in gene 
expression from the plurality of gene lod examined in the panel that is used to 
prepare a calibrated profile set that is informative with regards to a biological 
condition, biological efficacy of an agent treatment conditions or for comparison 
to populations and which may be used to identify likely candidates for a drug 
trial, iised in combiiiation with otiier dinical indicators to be diagnostic or 
prognostic with re^ct to a biological condition or may be used to guide the 
development of a pharmaceutical or nutraceutical throu^ manufacture, testing 
and marketing. 
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The numerical data obtained from quantitative gene expression and 
numerical data from calibrated gene expression relative to a baseline profile 
data set may be stored in databases or digital storage mediums and may 
retrieved for purposes including managing patient health care or for conducting 

clinical triak or for dtaraclerizing a drug. The data may be transferred in 
networks via the World Wide Web, email, or internet access site for example or 
by hard copy so as to be coUected and pooled from distant geographic sites 
(Figures). 

In a preferred embodiment, a descriptive record is stored in a single or 
multiple databases where the stored data includes the raw gene expression data 
(first profile data set) prior to transformation by use of a baseline profile data 
set, as weU as a record of the baseline profile data set used to generate the 
caKbrated profile data set including for example, annotations regarding whether 
the baseline profile data set is derived fi»m a particular signature panel and any 
other annotation that fadUtates interpretation and use of the data. 

Because the data is in a universal format, data handlingmay readily be 
done with a computer. Hie data is organized so as to provide an output 
optionally corresponding to a graphical representation of a calibrated data set. 
For example, a distinct sample derived firom a subject being at least one of RNA 
orproteinmaybedenotedasP,. The first profile data set consists of M, where 
■ is a quantitative measure of a distinct RNA or protein constihient. The 
record R, is a ratio of M and P and may be annotated with additional data on the 
subject relating to for example, age, diet, ethnicity, gender, geographic location, 
medical disorder, mental disorder, medication, physical activity,body mass and 
environmental exposure. Moreover, data handling may further include 
accessing data firom a second condition database which may contain additional 

medical datanotpresently held witiithecalibrated profile datasete. In this 
context, data access may be via a computer network. 

The above described data storage on a computer may provide the 
informationinaformthatcanbeaccessedbyauser. Accordingly, the user may 
load the information onto a second access site including downloading the 
information. However, access may be' restiicted to users having a password or 
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other security device SO as to proted: the medical records contam A 
feature of this embodiment of the invention is the ability of a user to add new or 
annotated records to the data set so the records become part of the biological 
information. 

The graphical r^resentation of calibrated profile data sets pertaining to a 
product such as a drug provides an opportunity for standardizing a product by 
means of the calibrated profile, more particularly a signature profile. The profile 
may be used as a feature with which to promote the drug. 

The various embodiments of tiie invention may be also implemented as a 
computer program product for use with a computer system. The product may 
include program code for deriving a first profile data set and for producing 
calibrated profiles. Such implementation may include a s^es of computer 
instructions fixed either on a tangible medium, such as a computer readable 
media (for example, a diskette, CD-ROM, ROM, or fixed disk), or transmittable 
to a computer system via a modem or otiier interface device, such as a 
commimications adapter coimected to a network over a medium. The medium 
may be either a tangible meditun (for example, optical or analog 
communications lines) or a meditun implemented with wireless techniques (for 
example, microwave, infrared or other transmission techniques). The series of 
computer instructions preferably embodies all or part of the functionality 
previously described herein with respect to the system. Those skilled in the art 
should appreciate that such computer instructions can be writta\ in a ntimber of 
programming languages for use with many computer architectures or operating 
sj^tems. Ftirthermore, such instructions may be stored in any memory device, 
such as semiconductor, magnetic, optical or other memory devices, and may be 
transmitted using any commxmications technology, such as optical, infrared, 
microwave, or other transmission technologies. It is e)q>ected that such a 
computer program product may be distributed as a removable medium with 
accompanying printed or electronic documentation (for example, shrink 
wrapped software), preloaded with a computer system (for example, on system 
ROM or fixed disk), or distributed from a server or electronic bulletin board 
over tiie network (for example, the Internet or World Wide Web). In addition, a 
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cx)mputer system is further provided including derivative modules for deriving 
a first data set and a calibration profile data set. 
Clinical Trials 

The use of calibrated profile data sets for perfonning dinical trials is 
illustrated in Figure 10 using the abovesiescribed methods and procedures for 
running a clinical trial or managing patient care. Moreover, standardization 
between laboratories maybe achieved by using a particular indicator ceU line 
such as THP-1 which is stimulated by a known stimulator such as 
Upopolysaccharide so that resultant profile acts as a measure that the laboratory 
is performing the protocol correc%. 

Examples of how embodiments of the invention may be used for 
augmenting clinical tiials includes providing new methods for patient selection. 
Clinical tiials in which candidate subjects are included or excluded accoixiing to 
a predetermined optimum calibrated profile for a given biotogical condition can 
resultinmorepredsemonitoringtiianwouldbeotherwisepossible. Itcanalso 
result in a greater efficien<y in dinical bial design because unsuitable patients 
that have, for example, compUcating factors or conditions can be screened out 
The calibrated proffle data will also enhance tiie "signal to noise" by removing 
nonrresponders fi»m double blind placebo shidies. The basic strucUire of a 
clinical tiial design using gene expression profiling could foUow any of several 
formats. Tliese include testing body fluid from a candidate patient in tiie trial cr 
vivo against a new therapeutic agent and analyzing ttie calibrated profiles with 
respect to an agent-treated and placebo-tireated samples using a predetermined 
panel and evaluating whetiier tiie candidate patient would be likely to respond 
without adverse effects to the composition being tested. In selected indications, 
profile data obtained from in vitro cell culhires or organ culhires may be desired 

where the cell originates from a target subject or from another subject or from an 
established ceU line, or fi»m a cell samples removed from the target subject 
where the ceU samples may be obtained from any body fluid including a blood, 
urine, semen, amniotic, or a cerebrospinal fluid sample, or from a scraping from 
mucosal membranes such as from tiie buccal cavity, tiie eye, nose, vagina or by 
means of a biopsy including epithelial, Uver, sternum marrow, testicular, or 
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from tumor tissue removed surgically from a tumor at any location. The above- 
described sources of samples are applicable to any medical use in which 
calibrated profile data sets are desired. 

In vitro dosage and toxicity studies using calibrated profile data sets 
obtained from indicator cell lines or samples of the patient tested ex vivo can 
provide useful information prior to initiation of die clinical trial and can 
significantly reduce the cost and time of a dinical trial while increasing the 
likelihood of identifying the presence of beneficial effect. Li particular, the dose 
can be optimized on an individualized basis to maximize the impact on 
therapeutic outcome. For example. Figure 12 shows how ex vivo blood cells 
respond to the stimulatory effect of LPS and the subsequent treatment with an 
anti-ii\flammatory drug (methotrexate, meclofenamate or methylprednisolone). 
The data show how the effect of methotrexate and meclofenamate generates 
similar calibrated profile data sets where the baseline is LPS treated blood. In 
contrast, the methylprednisolone has a substantially different effect from the 
other two compounds. Asimilar type of anal}^is can be performed with 
complex mixtures as illustrated in Figure 21 in which tiie calibrated profiles 
obtained when Echinacea, Arnica and Siberian Ginseng applied to LPS 
stimulated blood ex vivo are compared. In this example, all three agents appear 
to act differently from each other with respect to a sample from a single subject. 
Similar analyses can be used to compare compoimds with tmknown targets or 
activities or metabolic patterns to compotmds, complex or simple, with known 
or pre-determined profiles. 

The above methods and procedures can be utilized in the design and 
running of clinical trials or as a supplemental tooL Moreover, the above 
methods and procediures can be tised to monitor the patients' health as well as 
the patient's responsiveness to an agent before during and after the clinical trial 
This includes monitoring whether mtdtiple agents interfere with each oflier, act 
synergistically or additivdy or are toxic or neural with respect to each other. 
This type of information is very important as individuals take an increasing 
nimiber of medications. 
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Similarly, the methods and procedures described above may be used to 
manage patient care for an individual or a population. Such methods and 
procedures may also be used to develop a regional or global research network 
that uses caHbrated profile data sets and the rating databases to conduct 
research or trials. 

Both the caHbratian proffle data sets in graphical form and the associated 
databases together with information extracted from both are commodities that 

can be sold together or separately for a variety of purposes. For example, 
grapWc representations of calibration proffle data sets can provide a description 

of a product with respect to its activity Aat may be used to promote the 
product Altematively,thegraphicalformofthe calibrated proffle data setsand 
access to baseline proffle databases provide a means for manufacturers to test 
discrete batches of product against a gold standard. 

The data can be used strategically for design of clinical triab. It can also 
be useful for physicians practicing at remote sites to offer personalized 
healthcare to a patient. Accordingly, the physician might set up personalized 
databases for calibrated proffle data sets prior to and after treatment of a 
particular condition. New data on the subject could be added to the 
personalized database at each visit to tixe doctor. The data could be generated at 
remote sites by the use of kits that permit a physician to obtain a first proffle 
data set on a sample firom a patient For remote users to access the site, it is 

envisaged that secured access to the global network containing libraries of 
baseline proffle data sets and caKbrated proffle data sets, classified by particular 
criteria and representing data horn larger populations than a single individual, 
would be necessary. The access to the global database may be password 
protected thereby^rotecting the database firom corrupted records and 
safeguarding personal medical data. Hie graphical form provided by the 
calibrateddatasetsmaybeusedtocreatecatalogsofcompoundsina , 
pharmacopiae complete with toxic effects that mi^t arise for particular 
individuals as well as other types of drug interactions. 

Access to the global data base may include the option to load selected 
data onto a second acceiss site. This process could include downloading tfie 
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information to whatever site is desired by ttie user and could include securing 
hard copies of information. It is desirable to control how and what data is 
offloaded or dopied to maintain the integrity of the database. It is envisaged 
that while a global network of clinical data would be an informational resource, 
it would have utility is conducting research tiiat might include epidemiological 
studies and studies concerning the mechaiusm of action of an agent and studies 
concerning the nature of interpersonal variability as determined by calibrated 
profile data sets. 
Examples of Medical Uses 

(a) Early detection of infectious diseases- Markers or sxurogate 
markers from mice may be obtained for measuring gene expression in htmians 
that indicate early or immediate response to infection, for example, to a virus 
such as hepatitis virus, or to a bacterium such as Mycobacterium tuberculosis (the 
Gram-positive etiologic agent of tubercvdosis) (see Figure 4), Candidate genes 
are identified and changes in e}q>ression of those genes in the presence of a 
challenge provide a set of markers. The set of markers can combine markers 
encoded by the genome of the subject and one more distinctive markets 
encoded by the genome of tiie ii\fectious agent For example, changes in 
expression of an immediate early gene of a virus, e.g. a gene encoding an 
enzyme of viral replication, and a host gene such as the gene for any or aU of IL- 
2, TLr4 and IL-5, can comprise markers or surrogate markers for a medical 
condition capable of detecting that condition prior to the onset of medical 
sjrmptoms. This method affords earlier detection of an infection than is possible 
using current diagnostic techniques. 

(b) Toxicity profiles and mechanistic profiles obtained from an in vitro 
assay and in vivo assays. Toxicity and mechanistic information arising from the 
administration of a compound to a population of cells can be monitored using 
calibrated profile data sets. The following is an example of an experimental 
protocol for obtaining this information. Firstly, an experimental groups is 
established: (1) control cells maintained without therapeutic agent and without 
stimulus; (2) cells treated with therapeutic agent but without stimulus; (3) cells 
without therapeutic agent but with stimulus, (4) sample with therapeutic agent 
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and with Stimulus. The population of cells can be sdected from primary ceU 
cultures prepared in culture plates using mefliods weD established in the art; or 
mature differentiated ceU preparation from whole blood or isolated monocytes 
from the target organism, which in this example is mouse. 

Hie cells are stimulated so as to present a targeted physiological 

conditionbypretreatmentwithU«purifiedfromaGram-negativebacterium(a 
variety of LPS prq>arations from pathogenic bacteria, for example, from 

Salmmdla typhimurium and from Escherichia coli 01157:H7, are available from 

Sigma, St. Louis, MO). Ihe therapeutic agent administered to the cell sair^les in 

this example is an inhibitor of an enzyme known to be key in disease etiology, 

namely an inhibitor of a protease or a nucleic add polymerase. Following 
treatment by addition of the therapeutic agent and further incubation for four to 
sbc houi^, samples of the cells are harvested and analyzed for gene expression. 

Nucleic add, spedficaUyRNA,canbeprepared from the sampleby methods 
known to one or ordinary skill in the art (see, for example, the Lyse-N^~ 
J^agent, Pierce Oiem. Co:, Rodcford, IL). Samples are analyzed by QPCR 
according to a cjuantitative replicative procedure, (ijuantitathre polymerase 
chain reaction procedure (QPCR)) (see, for example, Gibson,U. 1996 Genome 
Res.6:995.1001, and references dted therdn). Total RNA was assessed using 
universalprimers. Toxidty of the agent for cells can be measured in untreated 
cells by vital stain uptake, rate of DNA synthesis (autoradiography of labeled 
nuddc compared to cells stained), stain by DNA-spedfic eyes (Hoed^t), etc 

Medianistic profQes can be determined by analysis of the identifies of & now 
up- or down-regulated genes. Further, in tiie presence of a therapeutic agent, 
some genes are not expressed, indicating potential efficacy of the therapeutic 
agentinsuppressingtheeffectsofstimulationbytheUPS. For example, in 

Figure21,levels of ICE that are somewhat stimulated in tiiepresenceof LPS 
+Ed»inacea are substantially depressed by LPS + Arnica relative to LPS 

stimulated cells absent agent Levels of HSP 70 whidi are depressed in the 
presence ofLPS+Ediinaceaaresubstantially stimulated in the presence of LPS 
+Amica, and LPS + Siberian Ginseng relative to LPS stimulated cdls absent the 
additionofanagent. I^elsof II.12p40whidi are sUghtly increased in the 
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presence of LPS + Echinacea are substantially depressed in the presence of LPS 
+ Arnica and LPS +Siberian Ginseng relative to LPS stimulatioiL In contrast to 
the above iising nutraceuticals. Figure 16 shows a much enhanced reduction of 
gene expression in whole blood for IL-la, Il-lb, 11-7, 11-10, IHL-15, IFN-g, TGF- 
b, TNF-b cox-2, and ICAM in the presence of prednisolone +LPS when 
compared to arnica +Lre or nothing +LPS. 

(c) Quantitation of gene expression in a blood cell to predict toxicity 
in another tissue or organ. 

Leukocytes can be obtained from a blood sample of a subject, for the 
purpose of assessing the appearance of a pathological condition in another 
organ, for example, the liver. A profile data set is obtained of genes expressed in 
the lexikocytes, for example, genes encoding a set of lyn^hokines and cytokines. 
The data set is compared to tiiat of ttie database, to examine correlations for 
example to other subjects, and to tiie subject prior to administration of a 
therapeutic agent. 

By this method, a correlation can be drawn between, for example, 
administration of acetaminophen (Tylenol) and sensitivity to this therapeutic 
agent and manifested by liver damage. An early prediction of therapeutic agent 
sensitivity, detected prior to the onset of actual damage to tiie liver, can be 
clinically available so that the subject receives no further administration of 
acetaminophen. The success of the database is the ability to detect a correlation 
or correlations prior to the onset of traditional medical assessments, such as 
increase in bilirubin level or other indication of liver pathology. 

(d) Calibrated profiles firom blood cells for prognosis of severity and 
prediction of adverse reactions in treatment of an autoimmune disease. 

The probability and timing of onset of symptoms of an autoimmune 
disease, for example, rheumatoid arthritis, may be monitored by appearance of 
expre^ion of markers or surrogate markers as determined by ttie methods of 
gene e^q^ression profiling of markers or surrogate markers and comparison to a 
profile database as described above. Thus an indication of imminent onset can 
be obtained, and advance management by utilization of preventive measures to 
forestall onset, can be taken. Further, the user can choose a set of potential 
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therapeutic agents, and assess for a given agent, the probabiUty that a subject 
wiU present an adverse reaction if given a fuU course of treatment, prior to that 
fuU course. For example, using embodiments of the invention, a single dose of 
the agent methotrexate can be administered to a subject having arthritis and in 
need of a therapeutic agent If the gene e)q)ression profile data set of the subject 
in response to a single dose of methotrexate correlates with data sets from 
subjects having adverse reactions to this agent, then administration of a fuU 
course of methotrexate is counterindicated. Conversely, if the gene e)q)ression 
proffle data set correlates with those of subjects who have responded positively 
to administration of a course of methotrexate treatment, then this therapeutic 

agent can be administered to the subject wifli much lower probabiUty of adverse 
reaction. 

Discussion of Fig ures -| 

Hgures 1-4 illustrate some of the appUcations of calibrated profile data 
sets. In Kgurel, three possible scenarios are provided. Firstty, a candidate 

therapeuticagentmaybetestedtodetermineitsmolecularpharmacologyand 
toxicology profiles. Ihe test might include obtaining calibrated profile data sets 
for a series of pands selected on flie basis of what activity is predicted for the 
drug. The population of cells exposed to the agent may be the result of in vivo 
administration as depicted by the mouse or direct exposure m vitro where the 
cells may be an indicator cell line or an ex vivo sample from the subject. The 
result of the screen is the identification of more effective drug candidates for 
testing in human subjects. 

Hie second scenario in Figure 1 is the use of calibrated profile data sets to 
identify a suitable clinical population for screening a potential flierapeutic agent 
Both demonstration of lack of toxicity and demonstration of clinical efficacy 
requirecertainassumptionsaboutthedinicalpopulation. The calibrated profile 
data sets provides a means for establishing fliose assumptions with respect to 
the biological condition of the individuals selected for the clinical trials. 

Tlie third scenario in Figure 1 is the opportunity to practice 
individualized medicine which may indude creating an archive of calibrate 
profile data sets on the individual in a state of health such that changes can be 
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identified using signature panek so as to permit prognosis or diagnosis of a 
particular condition. Moreover, stored information about the patient in the form 
of calibrated profile data sets permits selecting one of a group of possible 
therapeutic agents most like to be effective for the patient, optimizing dosage of 
drug and detecting adverse effects that might arise through drug interactions 
before symptoms arise. Theresultof the use of cahT^rated profile data sets is to 
provide more efficient and cost effective health care management. 

The novel approach described above for evaluating a biological condition 
of a subject may be applied to an ex vivo or in vitro assay for measuring the effect 
of an agent on a biological condition as illustrated in Figures 2-4. A sample from 
the patient may be measured directly ex vivo or tested ex vivo against an agent to 
predict an effect in the patient This provides a quick and effective way to 
determine which drug chosen from within a single dass of drugs fliat all may be 
used to treat a particular condition, may be most effective for a given subject 
Alternatively, an agent may be tested on an indicator cell line tiiat can provide a 
quantitative measure of therapeutic performance in a ckiss of individuals. 

Figure 2 illustrates how calibrated profile data sets may assist in 
screening a library of candidate compounds to discover candidate drugs. 
Starting with, for example, 500 candidate drugs, these can be tested in indicator 
cells or ex vivo body fluid or tissues against signature panels for in vitro 
toxicology or metabolic indicators. The figure illustrates the large number of 
compounds that entered in late stages in the development process only to 
ultimately be rejected due to adverse biological interactions. It is e)q>ected that 
early adoption of the use of calibrated profile data sets will more readily identify 
likely successful candidates and thereby reduce the expense and untoward 
e^cts of animal and human experimentation for compounds that could have 
been predicted to fail. 

Figure 3 describes multiple screens in which a comix>tmd roight be 
administered to an experimental animal such as a mouse or to an indicator cell 
line. The in vivo or ex vivo or indicator cell sample might further be treated with 
a stimulus. The result of both the compoimd and the stimulus could then be 
detected using to signature profiles for toxicity or for mechatiism to compare the 
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effect of no drug +/- stimulus or +/-drug and no stimulus. Both in vitro Geft 
panel) and in vivo (right panel) studies can be used to evaluate the effect of a 
compound (drug, nutiraceutical, environmental stimuli, etc.). The right hand 
panel also illustrates the specific embodiment of an "in vitro clinical trial", that 
is, treatment of cells obtained from a subject and treated witii a compound (with 
or wittiout a stimulus) in vitro (or ex vioo) in order to predict the outcome of 
similar ti-eatinent of the subject in vivo (see Fig. 15 for a specific example). The 

output from both paneb is described as toxicity and mechanistic profiles. Either 
experimental course may be used to botii evaluate potential toxicity, e.g., using 
the toxicity, or Uver metabolism panels, and to determine or confirm likely 
mechanism of action by a critical selection of a gene panel(s) that illustrates and 
differentiates molecular mechanisms of action (see Figure 12 for a specific 
example). 

Figure 4 illusb:ates a bioassay in which cells are removed from tiie subject 
and tested ex vivo with the addition of a compound and also a challenge or 

stimulus. The ex wco effect of stimulus and flien drug on whole blood taken 
fr»m a human subject is shown in Figure 12 in which the stimulus is 
lipopolysaccharide (an iriflammatory agent) while the drug is any of 
methoti«xate, medofenamate or methylprednisolone using a signatiire panel for 
inflammation. Methylprednisolone, a drug commonly used in the ti:eatinent of 
acute exacerbations of COPD as well as in the chronic management of this 
disease, is considered to be a potent by non-specific anti-inflammatory agent 
However, as demonstiated in Figure 22, its effects on gene expression are 
dependent on tiie stimulus. While there are general quaHtative similarities 
between the effects on gene expression across these tiuee stimuH, there are bofli 
quantitative and quaHtative differences that may be important in understanding 
when glucocorticoid intervention is warranted. 

Accordirig to embodiments of the invention, an indicator cell population 
is used to measure quantitative gene expression the effect of an agent or a 
biological sample may influence tiie choice of which indicator ceU line will be 
inost informative. For example, a cloned ceU line such as THP-1 or a primary 
cell population (peripheral mononuclear cells) may provide information that is 
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comparable to tiiat obtained from a body sample directly (see Figure 15). The 

normal state of gene expression may range from zero or few franscripts to 10* or 
more transcripts. 

Similarly, an agent may be evaluated for its effect on any population of 
cells, either in vivo, ex vivo or m vitro, by administering ttie agent and then 
determining a calibrate profile data set for fl\ose cells imder the selected 
conditions. Examples of this approach are provided in Figures 10-16 and 18. 
Hgure 18 furtiher provides calibrated profile data sets for different 
concentrations of a single agent showing that the transcription of selected 
constituents vary with dose and dierefore the anticipated effectiveness witii 
respect to the biological condition. 

The above description of determining a biological condition is 
exemplified as folbws. The action of a pharmaceutical or nutraceutical is 
measured with respect to its anti-inflammatory properties. The measurement of 
the effect may be established using a panel of constituent gene lod, for example, 
an inflammation panel, including, Interleukin 1 alpha QL-la) or Tumor Necrosis 
Factor a^ha (TNF-a). The anti-inflammatory efiect may first be established by 
treating indicator cells or sample cells ex vivo with a known inflammation 
inducers (for example, lipopolysaccharide or other mitogens) followed by 
treatment witti the experimental agent or condition expected to suppress or 
reduce the expresision from the appropriate gene lod. Accordii^ tihe baseline 
profile data set is the delta change in gene expression for a particular panel of 
constituents. The addition of a potaitial anti-inflammatory agent results in a 
second delta change tiiat is superimposed on a first delta change. This is 
illusfrated for example in Figure 12. Methylprednisotone has a substantial 
down regulation effect on ILr2 in blood cells stimulated ex vivo with LPS where 
the basdine data set is LPS stimulated cells. In tiiis case there is a negative delta. 
In contrast, IL-2 appears to be upregulated in whole blood not previously 
exposed to LPS, where the baseline data set is tinstimulated cells. (Figure 16b) 
This is consistent with the observation tiiat methylprednisolone stimulated IL-2 
production. 
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TliedetenninationofthebiologicdcoiuUtionofasubjectmayindude " 
measuring and storing additional data about the subject. For example, if the 
subject is a human or mammaUan patient, additional clinical indicators may be 
determined from blood chemistry, urinalysis. X-ray, other chemical assays and 
physical or sociological findings. 

Figure 7 illustrates how the accumulation of caUbrated profile data sets 
can improve the predictive power of the database and thereby increase its value 
in generating information about a biological condition or agent The figure 

indicates the use of the database in terms of its predictive power to,f6r example, 
predictthe course of a therapeutic intervention, follow the course of an 
individual subject compared to a population, prediction of a likely mechanism 
of metabolism or molecular mechanism of action or a comprehenshre database 

thataUows comparison ofasingle profile toacoUectionof signature, calibrated 
precision profiles. 

Referred embodimente of how the database may be used is provided in 
Figure 8. Figure 8 ilhjstrates display of a data profile set from the source 
database. Entries for input include a name, an Experimental Type, and whether 
the enfry is a New Reference; Cell/Tissue/Species and whether these are new; 
Therapeutic agent (compound). Dose, and additional parameters and whether' 
thetiierapeuticagentisnew Observations are recorded according to the 
identity of a Gene (New Gene) and a Protein (New Protein). The Stimulus or 
other Treatinent, if any, and the Dose are entered. Gene (and/or Piotein) 

Expression, Expression Value, ExpressionUnits if appropriate and Expression 
Timeareshown. The figure spedficaUyillusfrates the range of appUcable fields 
of investigation from complex natural products to clinical frails in humans, 
linkage to fraditional forms of measurement and evahiationsudias Uterahire 
citations, clinical indicators and fraditional pharmacokinetic measurements. 
Expert analysis of the precision profile data contained in the d atabase may then 
be used to guide product development and marketing, or used to improve the 
clinical decision making concerning the health of a single individual or 
population of individuals. 
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It is anticipated that one form of record might provide information about 
a subject or agent with respect to identity, medical history including traditional 
pharmaceutical/ medical data, clinical indications as determined from literature 
data, reference to additiorml types of analysis in tihe database, etc 

Figure 9 shows an embodiment of ti\e present invention in which profile 
data is evaluated using data from a database that is remotely accessed over a 
network. The figure illustrates that data are expected to be derived at one or 
more locations, compared using a central database and information obtained 
used to affect, for example, the course of treatment of an individual or 
population- The two-way nature of 1109 illustrates the iterative process 
whereby the database affects the course of treatment or development, and 
outcome or response to such intervention again becomes part of the database. 
In a first location, as in Figure 5, from a tissue sample procured in box 1101, 
there are derived multiple RNA spedes pursuant to box 1102, and then in box 
1103, profile data are quantified to produce a profile data set that is pertinent to 
the tissue sample obtained in box 1101. In order to evaluate the profile data set, 
in box 1104 information is retrieved from database 1108, which is located in a 
second location. In fact the database may be in communication with a large 
number of locations, each of which is generating profile data that must be 
evaluated. The retrieval of information from tiie database is accomplished over 
a network 1109, which may include the Internet, in a manner known in the art. 
Once information has been obtained from the database 1108, the iirformation is 
used in evaluating the quantified profile data in box 1105, with the residt in box 
1106 that the medical condition of the subject may be assessed. The database 
1108 is in box 1107 updated over the network 1109 to reflect the profile data that 
have been quantified in box 1103. In this manner the database 1108 may be 
updated to reflect the profile data obtained over all locations, and each location 
has the benefit of the data obtained from all of the locations. 

EXAMPLES 

Example 1. (a) Use of whole blood for ex vivo assessment of a biological 
condition affected by an agent. 

Human blood is obtained by venipimcture and prepared for assay by 
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aliquoting samples for baseline, no Stimulus, ami Stimulus wiA 
volumeforatleastthreetimepoints. IVpical stimufi include Upopolysaccharide 
(LPS), phytohemagglutinin (PHA) and heat-killed staphylococci (HKS) or 
cairageean and may be used individuaUy (typically) or in combination. Ihe 
aUquots of heparinized, whole bipod are mixed without stimulus and held at 
37-C in an atmosphere of 5% C02 for 30 minutes. Stimulus is added at varying 
concentrations, mixed and held loosely capped at ZTC for 30 min. Additional 
test compounds may be added at this point and held for varying times 

depending on the expected phannacokinetics of the test compound. At defined 

times, cells are coUected by centrifugation, the plasma removed and RNA 

extiracted by various standard means. 

^) Preparation of RNA for Mp;>«„rir,|n r Gen^ Byp ^^,-,.^ 

Nucleic adds, RNA and or DNA are purified from cells, tissues or fluids 

ofthetestpopulationorindicatorceUlines. RNA is preferentially obtained 

from the nucleic add mix using a variety of standard procedures (or RNA 
Isolation Sti:ategies,pp.55.104, inRNA Methodnln pies. A lahor...^ r.,.^. 
Isolation and Charaderization 2nd edition, 1998,RobertE. Farrell, Jr., Ed., 
Academic Press); in the present use using a filter^ased RNA isolation system 
from Ambion (RNAqueous™, Phenol-free Total RNA Isolation Kit, Catalog 
#1912, version 9908; Austin, Texas). SpedficRNAs are amplified using message 
spedfic primers or random primers. The spedfic primers are synthesized from 
data obtained from pubUc databases (e.g., Unigene, National Center for 
Biotedmology Information, National Library of Medidne, Bethesda, MD), 
induding information from genomic and dDNA Kbraries obtained fhim hmnans 
and other animals. Primers arediosen to preferentially amplify from spedfic 
RNAs obtained from the test or indicator samples, see, for example, RT PCR, 
Oiapter 15 in RNA MeftodoloWes A Laboratory t^^u^^.^ 
■ Characterization , 2nd edition, 1998,Robert E. Farrell, Jr., Ed., Academic Press; or 
Chapter 22 pp.l43-151,RNAIsobtion and Q^arart.ri..H..jP^^ 
inMolecular Biology, Volume 86, 1998, R. Rapley and D. L. Manning Eds., 
Human Press, or 14 in Statistical refinement of primer design parameters,' 
Chapters, pp55r72,PCRAppUcations: Protocols for Fundional Genomic, 
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M.A.Irmis, P.H. Gelfand and ].]. Siiinsky, Eds., 1999, Acad^c Press). 
Amplificatioiis are carried out in either isothermic conditions or using a thermal 
cycler (for example, a ABr9600 or 9700 or 7700 obtained from PE Biosystems, 
Foster City, CA; see Nucleic Add Detection Methods, pp. 1-24, in Molecular 
Methods for Virus Detection. D.L.Wiedbrauk and D:H., Parkaa, Bda.^ 1 QQ^;^ 
Academic Press). Amplified nucleic adds are detected using fluorescent-tagged 
detection primers (see, for example, Taqman™ PGR Reagent Kit, Protocol, part 
number 402823 revision A, 1996, PE Applied Biosystems, Foster City CA.) tiiat 
are identified and synthesized from publidy known databases as described for 
the amplification primers. In the present case, amplified DNA is detected and 
quantified using tiie ABI Prism 7700 Sequence Detection System obtained from 
PE Biosystems (Foster Qty, CA). Amounts of specific RNAs contained in the 
test sample or obtained fix>m the indicator cell lines can be related to the relative 
quantity of fluorescence observed (see for example. Advances in Quantitative 
PGR Technology: 5' Nudease Assays, Y.S. lie and C.J. Petropolus, Current 
Opinion in Biotedmology, 1998, 9:43-48, or Rapid Thermal Cycling and PGR 
Kinetics, pp. 211-229, chapter 14 in PCR Applications: Protocols for Functional 
Genomics, M A Jhnis, DJHL Gelfand and J.J. Sninsky, Eds., 1999, Academic Press. 
Example 2. Different inflammatory stimuli give rise to different baseline 
profile data sets so that the calibrated precision profiles for different agents in 
the same class of anti-inflammatory result in different signature profiles. 

Figure 11 documents die usefulness of different inflammatory stimuli to 
give rise to different, baseline profile data sets so that the calibrated precision 
profile data sets for the three anti-inflammatory agents tested zestdt in different 
signature profiles. The diff^erent profiles reflect the difference in the molecular 
targets and mechanisms of action of the three agents derived from a single class 
of flierapeutics, anti-inflammatory agents. The figure also illustrates the 
extraordinary range of detection (y-axis) from less than 10 fold difference from 
the calibrated profile to plxis or minus 10E13 increase or decrease in gene 
expression when compared to the calibrator. Comparison to the calibrator 
resxdts in gene expression profiles that are increased, decreased, or without 
diange from the calibrated set. 
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Figure 11(a) shows relative gene expression (mRNA synthesis) in heat- 
killed staphylococci (HKS)-stimulated cells, and the effect of three di^rent 

compounds (TPCK, UT-77, and 'TW^ or dexamethasone). Compound TPCK 
caused a 10-fold decrease in relative IFN-yexpression, and 100,000-f61d . 
decreases in IL4 and IL-5 expression. Further, compound UT-77 caused even 
greater magnitude of increases in relath^e expression of the gene encoding IL-5, 
and more modest increases in IL-1 expression (more than lO-f old) and IFN-7. 
Such effects can be highly significant in disease etiologies and outcomes, and 
have predictive value concerning the usefulness as therapeutic agents of these 
compounds or similar chemical entities or chemicals that act similarly. HBCS 
cells are an in vitro model of Gram-positive bacterial infection. 

Fig. 11(b) displays analyses of expression of tiie 12 genes in 
lipopolysaccharide-(LPS)-treated cells, an in vitro model of Gram-negative 
bacterial infection. Ihese data indude several striking contrasts to the data in 
Hg. 11(a). Thus treatment with the therapeutic agent Dex caused a striking 
decrease in expression of the ILr2 gene in LPfr-treated cells, and a strildng 
increase in JLr2 expression in HKS-treated cells. Strikingly large differences in 
gene e)q)ression in the differentiy stimulated cells can be seen for the IL4 and 
the IL-5 genes. Expression of the gene for IFN, in contrast, responded similarly 
in cells treated by either of the stimuU and any of the therapeutic agents. 

By these criteria, expression of the genes for IL-2, IL4 and IL-5 were 
observed to be candidate markers or surrogate markers in ceU model systems to 
distinguish responses of the cells to Gram-positive and Gram-negative bacterial 
infection. 

^^P^^^ -- A single therapeutic a^t f or treating a particular condition ^r. 
be differentiated from a second therap e utic agent that also treats the p artimiai. 
condition by a sipiature profile for a g i ven panel of InH 

Kgure 12 shows a calibrated profile data set for a panel having 8 
constituents that are indicative of a biological condition fliat includes 
inflammation. The profiles are shown for three different anti-inflammatory 
agents-methotrexate, medofenamate and methylprednisolone. The calibrated 
. profile data sets for each agent as shown represents a signature profile for that 
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agent. This signature profile may serve as a device for establishing quality 
control for a batch of the agent. Indeed, it is envisaged that compounds or 
classes of compound s on the market or in development may be characterized by 
a signature profile. The signature profile may be represented in a graphical 
format, more particularly as a bar graph as provided in Figure 12. For Figure 12, 
an ex vivo sample was tested. A sample of blood was taken from the subject. 
Aliquots of the sample were subjected to lipopolysaccharide (LPS) ex vivo. After 
30 minutes, the anti-inflammatory agent as indicated was added to an aliquot of 
the sample of blood and after about another 4 hours, the expression of the panel 
of genes (H-la, 11-2, U-S, H-IO, Il-12p35, Il-12p40, IH5, IFN-Gamma and TNF-a) 
was determined. Although the calibrated profile of metiiotrexate and 
medof ehamate were similar, the calibrated profile of methylptednisolone was 
substantially different. Differences may be reflective of tiie differences of the 
mechanisms or target(s) of action of this agent witiiin tiie general dass of anti- 
inflammatory compoimds. The baseline is the profile data set for 
lipopolysaccharide absent any additional agents. 

Example 4. There is relatively low variability with respect to the profile within 
a single individual over time when the calibrated precision profile is determined 
from the measu rement of gene expression across many gene lod that have been 
appropriately induced. 

Figure 13(a)(b) and (c) show a graphical representation of calibrated 
precision profile data sets for two different samples of whole blood. 
Heparinized whole blood firom a single normal healthy volunteer was collected 
on two separate occasions of more than 2 weeks apart Figure 13a for sample 
991116 and Figure 13b,for sample 991028 reflect the biological condition of the 
tested cells from the single donor using a panel (Le., the inflammation panel) of 
24 members, in response to stimulation with one of three different agents. The 
baseline in tfiis example is derived from imtreated cells obtained firom the same 
individual. The calibrated profiles are shown for cells exposed for 4 to 6 hoiurs 
to lipopolysaccharide (LPS), heat-killed Stapylococd (HKS), and 
phytohemagglutinin (PHA). Figure 13c shows a direct comparison of LPS- 
stimulated blood sample 99116 with respect to blood sample 991028, Le., 991028 
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is used as the calibrator or baseline proffle data set. Ihe messenger RNA levels 
measured on 10/28/99 were used to compare the levels of messenger RNA 
measured on 11/16/99. A perfect identity of RNA levels would be represented 
byaflatlineatunity. Utese data clearly show that for baseline gene expression, 

th«e can be as much as an 8 fold difierence (o-jun) in messenger RNA levels. 
However, for most of the genes measured, the levels of messenger RNA 
measured on one day are within 2-3 fold of those measured on a different day. 
13(d) is similar to 13(c) except that the cells were not stimulated with LPS. 

The figure documents the relatively low variabiUty with respect to the 
profile within a single individual over time when ttxe calibrated precision profile 
is determined from Ae measurement of gene expression across many gene lod 
thathavebeenappropriatelyinduced. Ihe figure illustrates (1) the dass-specific 
effects (generaUy inflammatory as determined by Ae effect on pro-inflammatory 
gene lod, e.g., TNI^alpha, IL-l alpha and IL-l beta), (2) flte agent-spedfic effects 
quantitative differences between eadi of the agents at the same gene lod (e.g., 

IL-2) and (3) reprodudble and therefore predictable effects on the subject 
population, IK (Kgure 13c) 

Similarities and differences in th. .ff ^ pf ^ .jn^ lp 
populations differing in th eir biolo^ Vfll rnnHiH^» 

Ex-vivo gene expression analysis can be performed by obtaining the blood 
of a subject for example by drawing the blood into a vacutainer tube with 
sodium heparin as an anticoagulant An anti-inflammatory sudi as 3-methyl. 
prednisolone at a final concentration of 10 mioomolar was added to blood in a 

polypropylene tube, incubated for 30 minutes at 37C. in 5% CO2 . After 30 
minutes a stimuH suA as LPS at 10 ng/mL or heat killed staphlococcus (HKS) at 

1:100 dilution was addedtothe drug treated whole blood. Incubation continued 
at 37C. in 5% CO2 for 6 hours unless otherwise indicated. Erythrocytes were 

lysed in RBC lysis solution (Ambion) and remaining cells were lysed according 
to the Ambion RNAqueous-Blood module (catalog # 1913). RNA was eluted in 

Ambion elution solution. RNA was DNAsed treated with 1 unit of DNAse I 
(Ambion #2222) inlXDNAsebufferat37C.for30minutes. Firststrand 

-qmthesis was performed using the P«ldn.Elmer TaqMan Reverse 
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kit with MiiltiScribe reverse transcriptase (catalog # N808-0234). Quality check 
of RT reactions were performed with Taqman PCR chemistry iising the 18S 
rRNA pre-developed assay reagents (PDAR) from PE Biosystems (part 
#4310893E). PCR assay was performed on 6 to 24 genes in foxir replicates on the 
FE Biosystems 7700. PCR assays were performed according to specifications 
outiined with the PDAR product. Relative quantitation of the gene of interest 
was calibrated against 18S rRNA expression as described in PE product User 
Bulletin 2 (1997) and elaborated in Hirayama, et al (Blood 92, 1998:46-52) using 
18S instead of GAPDH. 

Relative quantitation of the mRNA was measured by the difference in 
threshold cycles between IBS and the gene of interest. This delta Cx was then 
compared to the normalizing condition, eitiier subject before treatment, or 
stimuli without drug in an ex-vivo assay to measure ''fold induction'' 
represented in Hie bar graphs. (Figure 14) For example in the above graph, IFN- 
levels are 1/50 less on day 3 than before treatment. 

Example 6 . In vivo and Ex vivo samples provide comparable signature profiles. 
Figure 15 shows the calibrated profile data set for two siibjects (Subject 1 
and Subject 2) who have been treated over a three day period with a standard 
dose of the corticosteroids, dexamethasone. Blood from each subjects was 
obtained 72 hours later and a quantitative measure of the amount of RNA 
corresponding to the panel constituents was determined. Although, the 
calibrated profile data set for each subject was similar for most gene lod, some 
notable differences were also detected, for example for 11-2, 11-10, 11-6 and GM- 
CSF. A calibrated profile data set is also shown for comparison for an ex vioo 
sample of blood firom sample 1 prior to treatment with corticosteroid where the 
ex vivo sample is subjected to an equivalent amotmt of corticosteroid in vitro as 
calculated to be the plasma level in the subject. The similarity in the calibrated 
profile data set for ex vivo samples when compared to in vivo samples provides 
support for an in vitro assay that will predict the in vivo action of the compoxmd. 
We have observed a similsir comparable effect between in vivo and ex vivo 
samples infected with an infectious agent, more particularly bacterial or viral 
agents. We have concluded therefore that the ex xnvo samples provide an 
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effective method Of detenniiung the effect Of a Single compos 
compounds on a patient, where the multiple compounds may be either used in 
combination, in parallel or sequentially to optimize the selection of an agent for 
a biological condition for ti»e subject 

S^aSQEkZ. Demonstaration of repiodtirihilifY of an in wtm iv>^ »co 
approved anti-inflaTnTnatorv on 5 diffe rent donor ta^h^ 

Comparison and analysis of tiie Rgures 18a tiirough 18e demonstrates 
the consistency of effect of the stimulus and in vitro treatment with an approved 
anti-inflammatory on 5 difierent donors (each figure representing a unique 
donor). Ihe use of a known and tested stimulus results in a highly reproducible 
gene response in vitro that may be correlated with a predictable in vivo response. 
Figures 18a-18e provide the results of analysis of 5 donors from which a blood 
sample has been taken, llie blood samples were exposed to a therapeutic agent 
at various concentrations ranging ftom O.lpM tb 5mM, more particularly O.lpM, 
0.3mm, lnM,3MMand5MM,fora4hourperiod. EWfferent concentrations of tiie 
drug resulted in a calibrated proffle data set foLaaManunationpanel at each 
concentration that was quaKtatively different from tihe next Figure 18a 
corresponds to donor 1, Figure 18b corresponds to donor 2, Hgure 18c 
corresponds to donor 3, Figure 18d corresponds to donor 4 and Figure 18e 
corresponds to donor 5. Each individual varied from the other and also 
provided a variable profile for a different concentration. This set of figures 
illustirates the high level of information obtainable by calibrated profile data sets. 
^^P^^^ - A cah^rated profile data set rrx^y provide a ri^ .f,i,e profile fnr ^ 
complex mixtiirp rtf comp ounds. 

Figure 21 illustiates the effect of three different anti-inflammatory herbs 
on a panel of constituents including constituents of an inflammatory panel 
(INF-a, n-lb, ICAH n-8, n-lO, Il-12p40, ICE. cox-2, cox-l and mmp^) fi cell 
growth and differentiation panel (c-fos, c-jun and STA13) , a toxicity panel 

(SOD-1, TACE, C3R, HSP70, GST, c-fos, o^un, INOS) and a Uver metabolism 
panel (INOS, cyp-a and u-pa). llie cells assayed in Figure 21 are aHquots of 
blood from a subject that are exposed ex vivo to lipopolysaccharide and to 
Echinacea (SPM9910214) Arnica (SP^^^ 
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(SPM9910074), ead\ of the nutraceuticals being applied to the blood sample at 

*) 

thesamecx)ncentrationof 200ug/ml. The baseline is cell sample with 
lipopolysaccharide in the absence of a nutraceutical. Eachnutraceutical(fonned 
from a complex mixture) has a characteristic signature proffle just as did the 
single compotmd pharmaceutical anti-inflammatory agents. The signature 
profile may be provided in a graphic form that can be use to identify a herbal 
while providing information concerning its properties and its efficacy for a 
single subject or for an average population of subjects. 
Example 9. A quality control assay for Echinacea brands using calibrated 
profile data sets. 

Figure 24 shows a graphic representation of the calibrated profile data 
sets for four different commercial brands of Echinacea Brands using an 
Inflammation Panel. As expected, SPM007 and SPM003 gave Ae signature, 
calibrated profiles similar to authentic Echinacea samples SPMOlO and SPM 016, 
. although labded and sold as Echinacea when tested using ftie system described 
in Figure 14, resulted in signature calibrated profiles that were substantially 
similar to the profile obtained with lipopolysaccharide alone. Echinacea 
samples SPMOlO and SPM016 were fbimd to have elevated, highly biologically 
active levels of endotoxin while the LPS levels in SP007 and SP003 were 
undetectable. A stored signature profile for active echinacea obtained from a 
panel designed to test efficacy and mode of action, e.g., the inflammation panel, 
permits evaluation of new batches of Echinacea, differentiation of existing or 
new brands of Echinacea, guide flie isolation and development of new 
compoimds with different or similar activities from a complex compoumd like 
Echinacea or may be used in the development of quality assurance in flie 
production, analysis and sale of new or previously marketed compoimds. In 
the example dted, two of the brands of Echinacea SPOlO and SP016.result in 
calibrated profiles that are diaracteristic of authentic Echiiiacea. 
Example 10. Comparison of three herbal preparations using an indicator cell 
line. 

Figures 25 (aHc) provide calibrated profile data sets for three herbal 
preparations with respect to an indicator cell line (THP-l) rather than a blood 
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sample from a subject. In Kgure 25(a), the baseline is the profile data set for 
THP-1 cells absent the herbal while the histograms represent the calibrated 
profile data sets for the same herbal from three different manufacturing sources 
ofthesameherbat250ug/mL Gene expression results are shown on a log 
scale. Similar to the observation in Figure 14, these demonstrate that similarly 

labeled compounda obtained from different sources have demonstrable and 
quantifiable differences in calibrated profiles using a specific panel, eg. the 
inflammation panel designed to obtain information about the expression of 
gene products related to inflammation and infection. This suggests that the 
compounds likely have different efficacies when used for specific purposes. 

Figure 25(b) provides a comparison of the calibrated profile of a single 
herb at three concentrations using the indicator ceU line of THP-1. The baseline 
profile data set is untreated THF-l cells. Analysis of the data suggests a 
concentration-dependent response in the indicator cell lines which, alflwugh 
demonstrated here, may be indicative of a similar response in subjects. 

Figure 25(c) provides a comparison of four commercial Echinacea brands 
used at the same concentration and tested against a panel of constituents using 
a THP-1 ceU line as an indicator cell population. Differential expression, as 
revealedby differences in the calibrated profiles, allows direct comparisons of 
complex ooiiqwunds to be made. For example, analysis of the differences in the 
calibrated profiles could be used to guide compound isolation and 
development, product differentiation in the marketplace, or used by the 
consumer or health professional to guide the individualized choice of a single 
compound fi:om a class of similar compounds that may be suited for a 
particular biological condition. 

Alti^ough various embodiments of tiie invention have been disclosed, it 
should be apparent to those skilled in the art that various changes and 
modifications can be made which will achieve some of the advantages of the 
invention withoutdepartingfcomthetruescopeoftheinvention. Iheseand 
other obvious modifications are intended to be covered by the appended claims. 
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1. A method, for evaluating a biological condition of a subject, 
comprising: 

a. obtaining from the subject a sample having at least one of 
RNAs and proteins; 

b. deriving from the sample a first profile data set, the first 
profile data set including a plurality of members, each member being a 
quantitative measure of the amount of a distinct RNA or protein constituent in a 
panel of constituents selected so that measurement of the constituents enables 
measurement of the biological condition; and 

c. producing a calibrated profile data set for the panel, 
whdrein each member of the calibrated profile data set is a function of a 
corresponding member of the first profile data set and a corresponding member 
of a baseline profile data set for the panel, the calibrated profile data set 
providing a measure of the biological condition of the subject. 

2. A method, for evaluating a biological condition of a subject, 
comprising: 

a. obtaining from the subject a first sample having at least one 
of fluid, cells and active agents; 

b. applying the first sample or a portion thereof to a defined 
population of indicator cells; 

c. obtaining from &e indicator cells a second sample 
containing at least one of RNAs or proteins; 

d. deriving from Oie second sample a first profile data set, the 
first profile data set including a plurality of members, each member being a 
quantitative measure of ttie amount of a distinct RNA of protein constituent in a 
panel of constituents selected so that measurement of the constituents enables 
measurement of the biological condition; and 

e. producing a calibrated profile data set for the panel, 
wherein each member of the calibrated profile data set is a function of a 
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corresponding member Of the first pioffle data set and a corres^^^ 
of a baseline profile data set for the panel, the calibrated profile data set 
providing a measure of the biological condition of the subject. 

3. A method, for evaluating a biological condition affected by an 
agent, the metiiod compriang: 

a. obtaining, from a target population of cells to which the 

agent has been administered, a sample having at least one of ENAs and 
proteins; 

b. deriving fix)m the sample a first profile data set, the first 
profile data set including a pluraUty of members, each member being a 

quantitative measure of the amount of a distinct RNA or protein constituent in a 
panel of constituents selected so that measurementof the constituents enables 
measurement of the biological condition; and 

c. produdnga calibrated profile data set for the panel, 
wherein each member of the calibrated profile data set is a function of a 

correqKJnding member of the first profile data set and a corieqjonding men^^^ 
of a baseline profile data set for the panel, the calibrated profile data set 
providing a measure of the biological condition as afected by the agent 

4. A metiiod according to any of claims 1 through 2, wherein the 
baseline profile data set is derived bom one or more other samples from the 
same subject taken under conditions different from those of the sample. 

5. A method according to daim 4, wherein the conditions are 
selected firom the gioup consisting of (i) the time at whidi a given sample is 
taken, (ii) the site fix>m whidi a given sample is taken, (iii) the physiological 
condition of the subject when a gjven sample is taken. 

6. Amethodaccordingtodaim4,whereintheoneormoreother 
samples are taken over an interval of time that is at least twehre months between 
an initial sample and fhe sample. 
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7. A method according to claim 4, wherein the one or more other 
samples are taken over an interval of time that is at least one month between an 
initial sample and the sample. 

8. A method according to any of claims 1 through 3, wherein the 
sample is derived from blood and the baseline profile data set is derived from 
tissue or body fluid of the subject other than blood. 

9. A metihod according to claim 4, wherein the baseline profile data 
set is derived from one or more other samples from the same subject, taken 
when the subject is in a physiological condition different from that in which the 
subject was at the time the sample was taken, with respect to at least one of age, 
diet, medication, and environmental e)q>osure. 

10. A method according to daim 3, wherein flie baseline profQe data 
set is derived from one or more other samples from the same population taken 
under conditions different from those of the sample. 

11 . A method according to claim 10, wherein the conditions are 
selected from the group consisting of (i) the time at which a given sample is 
taken and (ii) the physiological condition of the population when a given 
sample is taken. 

12. A method according to daim 10, wherein the one or more other 
samples are taken over an interval of time tiiat is at least tweh^e montiis between 
an initial sample and the sample. 

13. A metiiod according to daim 10, wherein tiie one or more other 
samples are taken over an interval of time that is at least one month between an 
initial sample and the sample. 
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14. A method according to claim 10, wherein the sample is derived 
from blood and ttie baseline profile data set is derived from tissue or body fluid 
of the subject other than blood. 

15. A method according to claim 10, wherein the baseline profile data 
set is derived from one or more other samples of cell populations associated 
witii a common subject, the populations taken when tiie subject is in a 
physiological condition different from that in which tiie subject was at the time 
the sample was taken, with respect to at least one of age, diet, medication, and 
environmental exposure. 

16. A method according to any of claims 1 and 2, wherein the baseline 
profile data set is derived from one or more other samples from one or more 
different subjects. 

17. A method according to daim 16, wherein the one or more different 
subjects have in common with the subject at least one of age group, gender, 
ethnicity, geographic location, diet, medical disorder, clinical indicator, 
medication, physical activity, body mass, and environmental ej^osure. 

18. A method according to claim 3, wherein the baseline profile data 
set is derived from one or more other samples from one or more cell populations 
associated widi different subjects. 

19. A melhod according to daim 18, wherein the one or more different 
subjects have in comtAon witii the subject at least one of age group, gender, 
ethnidty, geographic location, diet, medical disorder, clinical indicator, 
medication, physical activity, body mass, and environmental exposure. 

20. A metiiod according to any of daims 1 through 3, further 
comprising: interpreting the calibrated profile data set in the context of at least 
one other dinical indicator. 
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21. A method according to daim 20, wherein tiie indicator is selected 
from the group consisting of blood chemistry, urinalysis. X-ray, other chemical 
assays, and physical findings. 

22. A method according to any of claims 1 through 3, wherein the 
biotogical condition is a complex disease process, involving multiple genes, the 

disease being of a type involving at least one of inflammation, auto-immune 
disease, degenerative disease, allergy, vascular disease, ischemia, cancer, 
developmental disease, hormonal condition, aging and infectious diseases. 

23. A method according to claim 22, wherein the biological condition 
is one of arthritis, asthma, multiple sderosis, and perimenopausal change. 

24. Amethod according to any of daims 1 and 2, wherein the subject 
is a living organism. 

25. A method according to daim 24, wherein flie subject is a mammal. 

26. A method according to daim 3, wherein the population of cells is 
human cells. 



27. A method according to daim 3, wherein the population of cells is 
mammalian cells* 



28. Amethodacoordingtoanyofdaimslthrough3,whereinthe 
sample is derived from one or more of body fluid and tissue. 

29. A method according to any of daims 1 through 3, wherein the 
sample is derived from blood. 

30. A method according to any of daims 1 through 3, wherein the 
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sample is derived from one of a biopsy, a needle aspirate, a lavage specimen, a 
scraping, and a surgical specimeati. 

31. A metiiod according to any of daimslthroug^i 3, wherein the 
sample is derived from tissue or flxiid of a type distinct from that with respect to 
which the condition is clinically manifested. 

32. A method according to any of claims 1 through 3, wherein the 
condition is a disease and the sample is derived from tissue or fluid of a type 
distinct from that which is a primary target of the disease. 

33. A method according to any of claims 1 through 3, wherein the 
function is other than a simple difference. 

34. A mettiod according to daim 33, wherein the function is a second 
function of the ratio of the corresponding member of first profile data set to the 
corresponding member of tiie l?aseline profile data set. 

35. A method according to claim 34, wherein the function is a 
logarithmic functioru 

36. A method according to any of claims 1 through 3, wherein each 
member of the calibrated profile data set is reprodudble with respect to similar 
samples taken from the siibject xmder similar conditions. 

37. A method according to any of daims 1 through 3, wherein eadi 
member of the calibrated profile data set is reprodudble within one order of 
magnitude with respect to similar samples taken from the subject under similar 
conditions. 

38. A method according to any of daims 1 through 3, wherein each 
meinber of the calibrated profile data set is reprodudble within fifty percent 
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With respect to similar samples taken fix>m the subject under similar conditions. 

39. A method according to any of claims 1 through 3, wherein each 
member of the calibrated profile data set is reproducible within twenty percent 
with respect to similar samples taken from the subject under similar conditions. 

40. A method according to claim 34, wherein each member of the 
calibrated profQe data set is reproducible within one order of magnitude with 
respect to similar samples taken from the subject under similar conditions. 

41. A method according to daim 34, wherein each member of the 
calibrated profile data set is reproducible within fifty percent witti respect to 
similar samples taken fix>m the subject under similar conditions. 

42. Amethodaccordingtoclaim34,whereineachmemberofthe 
calibrated piofife data set is reproducible within twenty percent wifli resp 
similar samples taken fi»m the subject under similar conditions. 

43. A method according to claim 34, wherein each member of the 
calibrated profile data set has biological significance if it has a value differing by 
more than an amount D, where D = F(l.l) - F(.9), and F is the second function. 

44. A method according to any of claims 1 through 3, wherein the 
biological condition concerns an organ and fte panel of constituents enables 
measurement of the condition in relation to the organ. 

45. A method according to any of claims land 2, wherein the 
biotogical condition concerns a system of the subject, tiie system selected from 
tiie group consisting of Inspiratory, vascular, nervous, metaboUc, urinary, 
reproductive, stiructural, and immunological systems, and the panel of 
constituents enables measurement of the condition of the subject in relation to 
ti\e system. 
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46. A metiiod according to claim 3, wherein the poptilation of cells is 
derived from a subject and the biological condition concerns a system of the 
subject, flie system selected from the group consisting of respiratory, vascular, 
nervous, metabolic, luinaiy, reproductive, structural, and iiruntmological 
systems, and the panel of constituents enables measurement of the condition of 
the subject in relation to the system. 

47. A metfiod according to daim 46 and the panel includes at least half 
of the constituents of the Inflammation Panel. 

48. A method according to claim 46 and the panel includes at least 
eighty percent of the constituents of the Inflammation Panel 

49* A method according to daim 46 and the panel indudes at least half 
of tiie constituents of the Cell Growth and Differentiation Panel 

50. A method according to daim 46 and the panel indudes at least 
eighty percent of the constituents of the Cell Growth and Differentiation Panel 

51. A method according to daim 46 and the panel indudes at leeist half 
of the constituents of a Toxidty Pand. 

52. A method according to daim 46 and the panel indudes at least 
eighty percent of the constituents of a Toxidty Panel. 

53. A method according to any of claims 1 through 3, wherein ttie 
number of constituents in the pand is at least three but less tiian 100. 

54. A method according to any of daims 1 through 3, wherein the 
mmiber of constituents in the panel is at least four but less than 100. 
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55. A method acxording to any Of daiiisl through 3, wherem the 
number of constituents in the panel is at least at least five but less than 100. 

56. A method according to any of claims 1 through 3, wherein the 
number of constituents in the panel is at least is at least sbc 

57. A method according to claim 3, wherein the agent is selected from 
the group consisting of a drug, a mixture of compounds, a functional food, a 
nutraceutical, a therapeutic agent, an allergen, and a toxin. 

58. A method according to any of daimsl through 3, wherein 

deriving the first profile data set from the sample includes hybridizing fltt 
sample with a set of nudeic add probes. 

59. A method according to daim 58, wherdn fte probes are attadied 
toaninsolubkmatrfacaiid.fliesamplieisappHedtothematrfac 

60. A method according to daim 3, wherein evaluating the coi^dition 
affeded by the agent indudes evaluating the interaction of the agent with a 
second agent administered to the population of cells. 

61 . A method according to daim 60, wherein the interadion is neutral 

62. A method according to daim 60, wherein the interaction is 
interference. 



63. A method according to daim 60, wherein the interaction is 
cumulative. 



64. A meAod according to daim 60, wherein the interaction is 
synergistic. 
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65. A method according to claim 60, wherein the agent is a 
pharmaceutical. 

66. A metihod, for evaluating the efifect on a biological condition by a 
first agent in relation to the effect by a second agent, tiie method comprising: 

a. obtaining, from first and second target populations of cells 
to which the first and second agents have been respectively administered, first 
and second samples re^ectivety, each sample having at least one of RNAs and 
proteins; 

b. deriving from the first sample a first profile data set and 
fi:om the second sample a second profile data set, the profile data sets each 
including a plurality of members, each member being a quantitative measure of 
the amount of a distinct RNA or protein constituent in a panel of constituents 
selected so tiiat measurement of the constitt^ts enables measurement of tiie 
biological condition; aiul 

c. producing for the panel a first calibrated profile data set 
and a second profile data set, wherein (i) each member of the first calibrated 
profile data set is a function of a corresponding member of the first profile data 
set and a corresponding member of a first baseline profile data set for the panel, 
and (ii) each member of tire second calibrated profile data set is a function of a 
corresponding member of the second profile data set and a corresponding 
member of a second baseline profile data set for the panel, the calibrated profile 
data sets providing a measure of flie effect by the first agent on tite biological 
condition in relation to ttie effect by flie second agent. 

67. A method according to daim 66, v^herein the first agent is a drug 
and the second agent is a complex mixture. 

68. A metiiiod according to daim 66, wherein the first agent is a drug 
and tile second agent is a nutriceutical. 

69. A metiiod according to any of daimsl through 3, wherein 
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Obtaining the sample and quantifying the first profile data set are peribrmed at a 
first location, and producing tite calibrated profile data set includes using a 
network to access a database stored on a digital storage medium in a second 
location. 



70. A method according to daim 69, further comprising updating the 
da;tabase to reflect the first profile data set quantified from the sample. 

71. A method according to daim 69, wherein using a network indudes 
accessing a global computer network. 

72. A method of conducting a dinical trial of an agent, the method 
comprising: 

a. causing the blind administration of a selected one of a 
placebo and the agent to eadi candidate of a pool of subjects; and 

b. using quantitative gene expression to monitor an effect of 
such adminisfiratioit. 

73. A method according to claim 72, wherein the pool of subjects is 
selected using quantitative gene expression analysis on a pluraUty of candidates 
to identify those candidates likely to show a response to the a^ent. 

74. A method according to daim 72, wherein the administration 
indudes determining at least one of a dosage and a dosage range by using 
quantitative gene expression analysis. 

75. A method according to daim 72, further comprising using 
quantitative gene expression analysis to assist in determining at least one of 
efficacy and toxidty of the agent 

76. A method according to any of daims 72 through 75, wherein using 
quantitative gene expression analysis indudes using the method of at least one 
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of claims 1, 2, and 3. 

77. A digital storage medium on which is stored a computer readable 
calibrated profile data set, wherein: 

a. the calibrated profile data set relates to a sample having at 
least one of KNAs and proteins derived from a target cell population to which 
an agent has been administered; and 

b. the calibrated profile data set includes a first plurality of 
members, each member being a quantitative measure of a change in an amount 
of a distinct RNA or protein constituent in a panel of constituents selected so 
that measurement of the constituents enables measurement of a biological 
condition as affected by administration of the agent 

78. A digital storage mediiun according to claim" 77, wherein: (i) each 
member of tiie calibrated profile data set is a function of a corresponding 
member of a post-administration data set and a corresponding member of a 
baseline data set; (ii) each member of the baseline data set is a quantitative 
measure of the amoxmt of a distinct RNA or protein constituent in the p and 
under a normative condition; and (iii) each member of the post-administration 
data set is a quantitative measure of the amoimt of a distinct RNA or protein 
constituent in the panel under a condition following administration of the agent 
to the target cell population. 

79. A medium according to daim 78, wherein the function is a second 
function of the ratio of the corresponding member of baseline data set to the 
corresponding member of the post-administration data set. 

80. A medium according to claim 79, wherein the second function is a 
logarithmic function. 

81. A digital storage medium according to any of claims 77-80, 
wherein the agent is a pharmaceutical 
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82. A digital Storage mecUmnaocoiding to any of claims 77^, 
whereintheagentindudesasecondpluraUtyofcomponents. ' 

83. A digital storage medium according to any of claims 77-80, 
wherein the agent is a nutraceuticaL 

84. A digital storage medium according to any of claims 77-80, 
i^erein the first pluraHty is at least three but less than 1000. 

85. A digital storage medium according to any of claims 77-80, 
wherein the first pluraKty is at least four but less than 1000. 

86. A digital storage medium according to any of claims 77-60, 
wherein the first plurality is at least five but less than 1000. 

87. A digital storage medium according to any of claims 77-80, 
wherein the first pluraUty is at least SDC 

88. AdigitalstoragemediumonwhichisstoredapluraHtyofrecords 
R, relating to a population of subjects, each record R, corresponding to a distinct 
instance P, of a computer readable profile data set P wherein: 

a. each instance P, of the profile data set P relates to a distinct 
sample derived from a subject, the sample having at least one of RNAs and 
proteins; 

b. profile data P set includes a pluraUty of members 
each member M, being a «juantitative measure of the amount of a distinct RNA 
or protein constituent in a panel of constituents selected so that measurement of 
the constituents enables measurement of a biological condition; 

c each record R, includes, for each member Al,-of a 
corresponding distinct instance P, of the profile data set P, a valle 
corresponding to the value of the member and 
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d. each record also includes a reference to a characteristic of 
the subject relative to the record, the characteristic being at least one of age 
group, gender, ethnicity, geographic location, diet, medical disorder, dirucal 
indicator, medication, physical activity, body mass, and enviroiunental 
exposure. 

89. A digital storage meditmi according to claim 88, wherein each 
sample is derived from a target cell population to which has been administered 
an agent, such target cell population being derived from a subject. 

90. A digital storage medium on is stored a large number of computer 
readable profile data sets, wherein: 

a. each profile data set relates to a sample derived from a 
target cell population to which has been administered an agent, the sample 
having at least one of RNAs and proteixts; 

b. each profile data set includes a plurality of members, each 
member being a quantitative meastu:e of the amount of a distinct RNA or 
protein constituent in a panel of constituents selected so that measurement of . 
the constituents enables measurement of a biological condition; and 

c. the panel is the same for all profile data sets. 

91. A method, for evaluating a biological condition of a subject, based 
on a sample from the subject, the sample having at least one of RNAs and 
proteins, the method comprising: 

a. deriving front the sample a fijrst instance of a profile data 
set, the profile data set including a plurality of members, each member being a 
quantitative measure of tihe amount of a distinct RNA or protein constituent in a 
panel of constituents sdected so tihat measurement of the constituents enables 
measurement of the biological condition; and 

b. producing a first instance of a calibrated profile data set for 
the panel, wherein each member of an instance of the calibrated profile data set 
is a function of a corresponding member of an instance of the profile data set 
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and a corresponding member of an instance of a baseline profile data set for the 
panel the calibrated profile data set providing a measure of the biological 
ccmdition of the subject; and 

c accessing a data in a condition database, the condition 
database having a pluraHty of records relating to a population of subjects, each 
record corresponding to a distinct instance of the calibrated profile data set; and 

d. evaluating the first instance of the calibrated profile data set 
in rdatioii to data in the condition database. 

92. A method according to claim 91, wherein accessing the condition 
database includes accessing the condition database over a network. 

93 . A method according to claim 92, wherein the network is a global 
con^uter network. 

94. A method aaprding to daim 92,further comprising: 
supplementing the conditioii database based on data associated with the first 
instance of the calibrated profile data set 

95. A method according to daim 92, wherein the biological condition 
concerns a system of the subject, the system selected torn the group consisting 
of respiratory, vascular, nervous, metaboUc, urinary, reproductive, structural, 
and immunological systems and the panel of constituents enables measurem^ 
of the condition of the subject in relation to the system. 

96. A method according to daim 92, wherein each record also 
references a diaracteiistic of the population relathre to the record, ttie 
characteristic bemg at least one of age group, gender, ethnidty, geographic 
location, diet, medical disorder, dinical indicator, medication, physical activity, 
body mass, and environmental escposvue. 
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97. A method according to claim 96, wherein the characteristic 
includes a clinical indicator. 

98. A method of di^lajnng quantitative gene expression analysis data 
associated with measurement of a biological condition, the method comprising: 

a. identifying a fiist profile data set pertinent to the gene 
expression analyas data, ttie first profile data set including a plurality of 
members, each member being a quantitative measure of the amount of a distinct 
RNA or protein constituent in a panel of constituents selected so that 
measurement of the constituents enables measurement of the biological 
condition; 

b* producing a calibrated profile data set for the panel, 
wherein each member of the calibrated profile data set is a function of a 
corresponding member of the first profile data set and a corresponding member 
of a baseline profile data set for the panel, tiie calibrated profile data set 
providing a measure of ti\e biological condition of the subject; and 

c diq>laying the calibrated profile data set in a graphical 

format. 

99. A method according to daim 98, wherein the function is a second 
function of the ratio of the corresponding member of first profile data set to the 
corresponding member of the baseline profile data set. 

100. A method according to claim 97, wherein the function is a 
logarithmic function. 

101. A metiliod according to daim 97, wherein the graphical format is a 
bar graph for each member of tiie calibrated profile data set. 

102. A descriptive record of a change in a biological condition in a 
population of cells, comprising: 

. a. . a first set of nxmierical gene expression values for a panel of 
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gene lod, each value in the set corresponding to a single gene locus in a panel of 
gene lod, the set of values forming a proffle data set for a population of cells 
subjected to a first biological condition; 

b. a second set of numerical gene expression values for ttie 

panel of gene lod, each value in the set corresponding to a single gene locus, the 
set of values forming a baseline profile data set for a second population of cells 
subjected to a second biological condition, the second set of values optionally 
being an average for multiple gene expression values from multiple populations 
of cells for each locus in the panel; and 

c. a tiiird set of numbers corresponding to the ratio of the first 
set of values and tiie second set of values witi» respect to each gene locus in the 
panel, the third set being a calibrated profile data set; the profile data set and the 
caHbrated profile data set being descriptive of the first biological condition witii 
respect to the second biological condition. 

103. A record according to daim 102, wherein the te popuktiw 
cells and tiie second or more population of cells are the same population of cells. 

104. A record according to daim 102, wherein tiie first population of 
cells and the second or more population of cells are different populations of 
cells. 

105. A descriptive record, according to any of daims 102 through 104, 
wherein a sample is obtained firom a subject, for subjecting the cells to a 
biological condition, the cell population being an indicator ceU population. 

106. A gene expression profile data set, according to any of daims 102 
through 104, wherein the population of cells is in a subject or derived fiom a 
subject 

107. A method for diagnosing a biological condition of a subject, 
comprising : 
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obtaiiung a sample from a subject; subjecting a population of cells to the 
sample and determiiung the presence of a first biological condition with respect 
to a second biological condition according to any of claims 1 through 3. 

108. A method according to claim 107, further comprising: selecting the 
subject for a dinical trial according to the biological condition of the subject, so 
as to determine predictivdy whether the subject will respond to a test 
compoimd if the compoimd has a predetermined biological activity. 

109. A metiiod according to claim 108, wherein the test compound is a 
pharmaceutical agent 

110. A method according to daim 108 where the test compound is a 
nutraceutical agent. 

111. A metiiod for diagnosing a sxisceptibility for a biological condition 
in a subject, comprising: 

a. obtaining a sample from the subject; 

b. creating a descriptive record, according to any of claims 102 
diroug^ 106, wherein tihe set of baseline values is an average of second values 
contained in a library of descriptive records for the second biological condition; 
the library containing a plurality of records grouped according to a 
predetermined biological condition; and 

c. diagnosing the susceptibility of the subject. 

112. A method for monitoring ttie progress of a biological condition, 
comprising: 

a. creating a plurality of descriptive records, according to any 
of claims 102 through 106, wherein each set of first values is determined'at 
preselected time intervals with respect to each of the other gene expression 
profiles; 
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b. comparing each calibrated proffle data set with a library of 
caKbrated profile data sets, the pluraUty of calibrated proffle data sets being 
grouped according to a predetermined biological condition; and 

c. detenniniiig the progress of the biological condition. 

113. A method of establishing the biological activity of a composition, 
comprising: 

a. selecting a popxUation of cells; 

b. subjecting the cells to the composition; and 

c determining the record according to any of claims 102 
through 106 using a standardized baseline proffle data set for the biological 
condition. . 



114. A method according to daim 113, whet«n the composition is i 
nutraceutical. 



115. A method according to daim 113, wherein the composition is i 
pharmaceutical. 



116. A method according to daim 113, wherein the composition is an 
infectious agent 

117. A method according to daim 113, wherein the composition is a 
complex mixture. 

118. A metiiod according to claim 113, wherdn establishing the 
biological activity of the composition further indudes providing a medianism of 
action for the composition. 

119. A method according to daim 113, wherein establishing the 
biological activity of the composition further indudes providing a mechanism 
for metabolism for the composition. 
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120. A method according to claim 113, wherein the composition further 
comprises a first compoimd and a second compotmd and the biological activity 
results from any of synergism, interference or neutral interaction between the 
first and second compoimd. 

121. A method according to claim 113, wherein flie compoimd further 
comprises a plurality of compounds such that the biological activity results from 
any of synergism, interference or neutral interaction between die compounds. 

122. A method according to claim 113, wherein the biological activity of 
the compound is a toxic efifect on the subject. 

123. A method of determining which therapeutic agent from a choice of 
a plurality of therapeutic ag^ts to administer to a subject so as to change a 
biological condition in a subject from a first biological condition to a second 
biological condition; comprising: 

a. subjecting a sample from the subject to each of a plurality of 
therapeutic agents; 

b. determining a descriptive record for each of the samples 
according to any of claims 102 through 106; 

c. comparing each of the calibrated profile data sets to a 
Kbrary of calibrated profile data sets; wherein the library of calibrated profile 
data sets being grouped according to a predetermined biological condition; and 

d. determining which of the therapeutic agents is capable of 
changing the first biological condition in the subject to the second biological 
condition in tiie subject. 

124. A method according to daim 122, wherein the first biological 
condition is a consequence of die adverse effects of any of an infectious agent, a 
biological warfare agent or an environmental agent and the second biological 
condition is a reversal of these adverse effects. 
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125. A method according to claim 122, wherein tfie library of 
descriptive records con^rise a medical history for a single subject or condition. 

126. A method according to daim 122, wherein tiie library of 
descriptive records comprise medical information about a pluraKty of subjects 
or conditions. 

127. A method according to daim 122, wherein the library of signature 
profile data sets consist of signature profile data sets from a plurality of subjects. 

128. A method for characterizing the biological effectiveness of a single 
batch of a composition produced by a manufacturing process, comprising: 
providing a fingerprint according to the mettiod of daim 112; and labeling the 
batch of the composition by pladng the fingerprint on each container in the 
batch. 

129. A method for accessing biological information on a digital storage 
medmm according to daim 88, comprising: making the information available to 



a user. 



130. A method according to daim 129, wherein the method further 
comprises malcing the information available to the user on any of a network. 
World Wide Web, email, internet access site or hard copy. 

131. A method according to claim 128, wherein the method further 
comprises accessing the information for loading to a second access site. 

132. A method according to daim 131, wherein the process for loading 
indudes downloading the information. 
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133. A method according to daim 129, wherein access to the 
information is controlled. 

134. A method according to claim 133, wherein the process of control 
includes the use of a password. 

135. A method according to claim 129, wherein the ttser can annotate 
the available infbrmatioiv the aimotation becoming part of the biological 
information. 

136. A method according to claim 129, wherein the iiser can add one or 
more records to the data set, the one or more records becoming part of the 
biological information. 

137. A method for consumer evaluation of a product, wherein the 
consumer evaluation is dependent on a signature profile according to daim 102, 
comprising: identifying the product using the signature profile. 

138. A method according to daim 137, wherein the product is 
promoted according to the signature profile. 

139. A computer program product for evaluating a biological condition 
of a subject or for evaluating a biological condition resulting from the use of an 
agait, induding a computer tisable medium having computer readable program 
code thereon, the computer program code; comprising: 

a. a program code for classifying a sample from the subject or 
the agent for an identifiable record; 

b. a program code for deriving a first data set, the first profile 
data set induding a plturality of members, eadi member being a quantitative 
measure of the amount of a distinct RNA or protein constituent in a panel of 
constituents selected so that measiurement of the constituents enables 
meastirement of the biological condition; the profile data set being stored in the 
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record; and 

c. a program cxjde for optionally producing a calibrated 
proffle data set for the panel, for storage in the record, each member of the 
calibrated profile data set being a function of a corresponding member of tiie 
first profile data set and a corresponding member of a baseline profile data set 
for the panel, the calibrated profile data set providing a measure of the 
biological condition of the subject. 

140. A computer system for evaluating a biological condition of a 
subject or for evaluating a biological condition resulting from the use of an 
agent, tite computer system, comprising: 

a. a classification module for classifying a sample from the 
subject or tiie agent in an identifiable record 

b. a derivative module for deriving a first data set, the first 
profile data set including a pluraUty of members, each member being a 
quantitative measure of the amount of a distinct RNA or protein constituent in a 
panel of constituents selected so that measurement of the constituents enables 
measurement of Ae biological condition; and 

c. a production module for producing a calibrated profile data 
set for the panel, wherein eadi member of the calibrated profile data set is a 
function of a corresponding member of the first profile data set and a 
corresponding member of a baseline profile data set for the panel, the cahbrated 
profile data set providing a measure of the biological condition of the subject 

141. A method for analyzing a patient for a biological condition at a 
remote site, comprising: 

a. providing a kit for measuring a profile data base for 
evaluating a biological condition, the kit including reagents for quantitative 
analysis of RNA or protein for a panel of gene lod; 

b. accessing a centialized database containing baseline profile 
data sets corresponding to the panel; 

c . determining the calibrated profile data set for the patient; 
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analyzing titie biological condition of the patient. 
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